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ABSTRACT. We find the asymptotic distribution of the muiti-dimensionai muiti-scaie and kernei 
estimators for high-frequency financial data with microstructure. Sampling times are allowed to 
be asynchronous. The central limit theorem is shown to have a feasible version. In the process, 
we show that the classes of multi-scale and kernel estimators for smoothing noise perturbation are 
asymptotically equivalent in the sense of having the same asymptotic distribution for corresponding 
kernel and weight functions. We also include the analysis for the Hayashi-Yoshida estimator in 
absence of microstructure. 

The theory leads to multi-dimensional stable central limit theorems for respective estimators and 
hence allows to draw statistical inference for a broad class of multivariate models and linear functions 
of the recorded components. This paves the way to tests and confidence intervals in risk measurement 
for arbitrary portfolios composed of high-frequently observed assets. As an application, we enhance 
the approach to cover more complex functions and in order to construct a test for investigating 
hypotheses that correlated assets are independent conditional on a common factor. 

Key words: asymptotic distribution theory, asynchronous observations, conditional independence, high-frequency data, 
microstructure noise, multivariate limit theorems 



1 Introduction 



The estimation of daily integrated variance and covarianc ^]has become a central building block in model 
calibration for financial risk analysis. Recent years have seen a tremendous increase in trading activities 
along with ongoing buildup of computer-based trading. The availability of recorded asset prices at such 
high frequencies magnifies the appeal of asset price models grounded on continuous-time stochastic pro- 
cesses which are a cornerstone of financial modeling since the seminal works by Black & Scho les| ( |l973| l 
and Heston ( 1993). Increasing observation frequencies makes it possible to consider efficient estimation 
from underlying statistical experiments. Rising demand for an advanced theoretical foundation thus gave 



birth to the field of statistics for high-frequency data, going back to the path-breaking work of Andersen & 
Bollerslev| ( [T998| , |Andersen et a/.| ( T200T][2003] l, and |Barndorff-Nielsen & Shephard| ( [200T]|2()02| i. 



This article contributes to this strand of literature by considering a continuous-time stochastic process, 
i. e. a continuous semimartingale X comprising current stochastic volatility models, observed on a fixed 
time span [0, T] at (n + 1) points of a discrete grid and by investigating asymptotics when the mesh size 
of the grid tends to zero. A natural estimator for the integrated variance of a process is the discrete version 
called realized variance or realized volatility. For a continuous semimartingale, this estimator is consis- 
tent, and it weakly converges with usual y/n-rate to a mixed normal distribution where twice the integrated 
quarticity occurs as random asymptotic variance (Barndorff-Nielsen & Shephard (2002), 
terl 



Jacod & Prot- 
Renyi ( 1963]) has been 



( 1998), Zhang (2001 )). Therefore, the concept of stable weak convergence by 
called into play to pave the way for statistical inference and confidence intervals. In our setting, stable 
convergence is equivalent to joint weak convergence with every measurable bounded random variabl^] 

1 More accurately known as integrated volatilities and covol atilities, but we here stick to the more heavily used terminology. 
2 For a discussion of the general case, see p. 270 of Jacod & Protter i 1998| . 
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and thus, accompanied by a consistent estimator of the asymptotic variance, allows to conclude a feasible 
central limit theorem. This reasoning makes stable convergence a key element in high-frequency asymp- 
totic statistics, and it completes the asymptotic distribution theory for a univariate setupj^] Yet, an apparent 
problem pertinent to applications is to quantify the risk of a collection of high-frequently observed assets. 
Suppose we wish to estimate the quadratic variation of a sum X\ + X2, both processes X\ and X2 docu- 
mented as high-frequency data and modeled by continuous semimartingales. The quadratic variation of the 
sum is the sum of integrated variances and twice the integrated covariance. For the latter, the derivation of a 
feasible central limit theorem is evident as for its one-dimensional counterpart. However, when estimating 
[X± + X2] with the sum of these estimates, we do not obtain a feasible asymptotic distribution theory for 
this combined estimator for free. This is due to the fact, that the single estimates are correlated. To deduce 
the asymptotic variance of the compound estimator, we are in need of a multivariate limit theorem involv- 
ing the asymptotic covariance matrix of the estimators. The first part of this article is devoted to that task 
and provides asymptotic covariances of covariance matrix estimators within prominent specific models for 
high-frequency data. 

The aspiration to progress to more complex statistical models in this research area, has again been mainly 
motivated by economic issues. First of all, in a multi-dimensional framework, different assets are usually 
not traded and recorded at synchronous sampling times, but geared to individual observation schemes. 
Employing simple interpolation approaches has led to the so-called Epps effect (cf. Epps ( 1979 )) that co- 
variance estimates get heavily biased downwards at high frequencies by the distortion from an inadequate 
treatment of non-synchronicity. In the absence of microstructure, the estimator by Hayashi & Yoshida 



(2005 ) remedies this flaw of naively interpolated realized covariances and a feasible central limit theorem 
has been attained in Hayashi & Yoshida ( j201 1[ ). 

For one-dimensional high-frequency data, increasing sample sizes are expected to render the estimation 
error by discretization smaller and smaller - which is clearly the case if we assume an underlain contin- 
uous semimartingale. Contrary to the feature of the statistical model, in many situations high-frequency 
financial data exhibit an exploding realized variance when the sampling frequency is too highj^] This ef- 
fect is ascribed to market microstructure frictions as bid-ask spreads and trading costs. A favored way 
to capture this influence is to extend the classical semimartingale model, where the semimartingale acts 
to describe dynamics of the evolution of a latent efficient log-price which is corrupted by an independent 
additive noise. Following this philosophy from |Zhang et al. ( 2005 I, several integrated variance estima- 



tors have been designed which smooth out noise contamination first. The optimal minimax conver gence 
rate for this model declines to n 1 / 4 , what is known from the mathematical groundwork provided by 



Gloter 



|& Jacod| ( |2001| ). This rate can be attained using the multi-scale realized variance by|Zhang|d2006l)r pre- 



averaging as described in |Jacod et al. ( 2009[), t he kernel estimator by Barndorff-Nielsen et al. (2008 ) or a 
Quasi-Maximum-Likelihood approach by Xiu (2010). Though the estimators have been found in indepen- 
dent works and rely on various principles, it turned out that they are actually quite similar and in a certain 
asymptotic sense equivalent which is clarified in Section [3]below. 

The approaches to cope with microstructure noise analogously carry over to the synchronous multi -dimen- 
sional setting. Recently, methods to deal with noise and non-synchronicity in one go have been estab- 
lished in the literature. In fact, to each of the abovementioned smoothing techniques one extension to non- 



synchronous observation schemes has been proposed. First, the multivariate realised kernels by Barndorff- 



Nielsen et al. ( 2011] ) using refresh time sampling are eligible to estimate integrated covariance matrices and 



guarantee for positive semi-definite estimates at the cost of a sub-optimal convergence rate. Ait-Sahalia 
et al. ( 20 1 0)l suggested to combine a generalized synchronization algorithm with the Quasi-Maximum- 



Likelihood approach. Eventually, a feasible asymptotic distribution theory for the general non-synchronous 
and noisy setup has been provided by Bibinger (2012 1 and Christensen et al.\\20\ 1 1 for hybrid approaches 
built on the Hayashi-Yoshida estimator and the multi-scale and pre-average smoothing, respectively. Al- 
though these estimators combine similar ingredients they behave quite differently, since for the approach 
in Bibinger] ( |2012| ) interpolation takes place on the high-frequency scale after smoothing is adjusted with 
respect to a synchronous approximation whereas Christensen et al. ( 201 1 1 suggest to denoise each process 



3 See Section 2 for definition and further discussion. 

4 This is usually seen with the help of a so-called signature plot, see |Andersen et q£|{2000) and also the discussion in Chapter 2.5.2 



of |Mykland & Zhang|(2012) . 
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first and take the Hayashi-Yoshida estimator from pre-averaged blocks which results in interpolation with 
respect to a lower-frequency scale. |Park & Linton ( 2012| l use Fourier methods on the same problem. 



Remarkably, when interpolations in the fashion of the Hayashi-Yoshida estimator are performed on 
the high-frequency scale their impact on the asymptotic discretization variance of the hybrid generalized 
multi-scale estimator vanishes asymptotically at the slower convergence rate in the presence of noise 0This 
reveals that non-synchronicity becomes less important in the latent observation model with noise dilution. 
In all four models, discretely observed continuous semimartingales with or without noise, synchronously or 
non-synchronously, we develop the asymptotic covariance structure of the respective estimation methods. 
We choose the realized covariance matrix, the multi-scale, the Hayashi-Yoshida and the generalized multi- 
scale estimator to establish the multivariate limit theorems. While the asymptotic covariances for the syn- 
chronous settings are found following a similar strategy as for the asymptotic variances, the most intricate 
challenge arises for non-synchronous sampling schemes. The asymptotic variance of the Hayashi-Yoshida 



estimator can be illuminated as in Bibinger (201 la I by a synchronous approximation and interpolations 



and hinges on an interplay of the two different sampling grids. For covariances, we consider two Hayashi- 
Yoshida estimates with generally four different sampling schemes. Nevertheless, utilizing an illustration 
with refresh times of pairs and quadruplets will reveal the nature of the asymptotic covariance. For the 
generalized multi-scale estimator in the most general setup we benefit again from the fact that interpolation 
effects fall out asymptotically. Yet, the effects by superposition with noise require a tedious notation to 
cover all possible sampling designs. In typical situations, as a completely asynchronous sampling design, 
only the signal parts will contribute to asymptotic covariances, and the multivariate distribution simplifies 
leading to a tractable general approach. 

Relying on the asymptotic distribution of the considered integrated covariance matrix estimators, we strive 
to design a statistical test for investigating hypotheses, if two processes have zero covariation conditioned 
on a third one. We end up with a feasible stable central limit theorem for the test statistic involving products 
of estimators and thus obtain an asymptotic distribution free test. This test which we call conveniently con- 
ditional independence test renders information about the dependence structure in multivariate portfolios 
and can be applied to test for zero covariation of idiosyncratic factors in typical portfolio dependence struc- 
ture models, as the one by Eberlein et a/.| ( |2008] l. In particular, we may identify dependencies between 



single assets not carried in common macroeconomic factors that influence the whole portfolio and disen- 
tangle those from correlations induced by market influences. 

The outline of the article is as follows. We start in Section|2]by discussing asymptotic covariance matrices 
for the realized covariances in a simple equidistant discretely observed Ito process setup. We proceed to 
statistical experiments with noise in Section [3] non-synchronous sampling in Section |4] and both at the 
same time in Section|5] In Section[6| the results are gathered to conclude feasible multivariate central limit 
theorems. The conditional independence test is introduced in Section [7] and after an empirical study of 
asymptotic covariances applied in Section [8] to high-frequency financial data. The proofs can be found in 
Appendix [A] 

2 The simple case: Asymptotic covariance matrix of realized covari- 
ances 

Assumption 1. Consider a continuous p- dimensional ltd semimartingale (ltd process) 

(1) 



X t =X + [ n s ds+ f a s dW s , t G R + , 
Jo Jo 



adapted with respect to a right-continuous and complete filtration (J-" t ) on a filtered probability space 
(f2, J 7 , (J^t), P) with adapted locally bounded drift process p, a p-dimensional Brownian motion W and 
adapted p x p' cadlag volatility process a. Suppose that a itself is an ltd process again, given by an 
equation similar to (fT). The processes a and W can be dependent, allowing for leverage effect. 

Since we have in mind financial applications in which X is assumed to represent an hypothetical un- 
derlying log-price process, we will call X efficient (log-price) process in this context. 



Similar findings were made for the two-scales estimator in Zhang 201 1 1, and for local likelihood in Bibinger et al. 2012 . 
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For the ease of exposition we restrict ourselves to p = 4, which suffices to reveal the general asymptotic 
covariance form. The target of inference is the integrated covariance matrix J Q T E s ds, over a certain fixed 
time span [0, T], where £ = aa T , with a from ([TJ. We denote in the following 



/ \ 

a 

V - 



,(12) ,(22) ,(23) ct (24) 

,(13) ,(23) ^(33) ct (34) 

r (14) _(24) (34) (44) 

U« L>S 



(2) 



in the four-dimensional setting. 



Assumption 2. The ltd process X from Assumption [7] with p = 4 is discretely observed at equidistant 
observation times iT/n, i G {0, . . . , n}^ 

We write 

AiX = Xir - X ji-pT ,l<i<n, and A\X = Xvr - X {l - j)T , 1 < i < n, 2 < j < i , (3) 

for the increments and for increments to longer lags, respectively, and analogously for the single compo- 
nents. 

The standard estimators for the integrated covariance matrix based on discrete observations considered in 
this section, but also the estimators designed to deal with noise and non-synchronicity below, will have 
variances hinging on the random volatility process a. This is the main motivation why for high-frequency 
asymptotics in the strand of literature on integrated variance estimation, stable weak convergence is in- 
herent as an essential concept]^] Stable central limit theorems allow for feasible limit theorems if the 
asymptotic variance can be estimated consistently and thus for statistical inference and confidence bandsj^] 
The stability of weak convergence with respect to T is established for all estimators considered throughout 
this article. In the sequel, we use the notation Var^( • ),Covs( • ) for random (co-)variances dependent 
on E. The stochastic limits multiplied with the convergence rate will be denoted AVAR and ACOV for 
asymptotic variance and covariance, respectively. 

Proposition 2.1. On Assumptions^and^forp — 4, the asymptotic covariance between realized covari- 
ances yields: 

Cn n \ f T 

V AiX^AiX^ , V AiX^ AiX^ )= T / (ct^'V^ + af^a^) ds , (4) 

for all k,l,r,q € {1,2,3, 4}. In particular, we have 



A generalization for non-equidistant sampling is covered by Proposition 4.1 in Section [4] From now 



on we express the general asymptotic covariances using indices 1, 2, 3, 4 as in the second formula above, 

6 The discussion in this section extends to data that are synchronous but mildly irreguar, cf. |Mykland & ZhangH2012) , Chapter 
2.7.1. 

7 Let Z n be a sequence of Af-measurable random variables, with Tt C \- We sa Y m at Z n converges stably in law to Z 
as n — > oo if Z is measurable with respect to an extension of X so that for all A 6 Tt and for all bounded continuous g, 
EI Ag(Z n ) — > ElAg(Z) as n — > oo. I a denotes the indicator function of A, and = 1 if A and = otherwise. In the case of 
no microstructure, X = Tt . If there is microstructure, X is formed as the smallest sigma- field containing Tt and also making the 
microstructure measurable. We refer to Jacod ( 1997 1, Jacod & Protter 1 1998 1, and Bibinger 1 201 la I for background information on 

stable convergence for this estimation problem. 

8 Stable convergence also permits the suppression of drift through measure change, see Section 2.2 of Mykland & Zhang 



which draws on Rootzen 1 1980 1. The device is similar to the passage to risk neutral measures in finance, going back to |Ros~ 



2009 



1976 



Harrison & Kreps[ fl979F , and Harrison & Pliska ( 1981). This mode of convergence also permits the localization of processes such 
as volatility, so they can be assumed bounded, see Chapter 2.4.5 of Mykland & Zhang 1 2012). 
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and obtain special cases by inserting '(1) = (2)' etc. Proposition 2.1 includes the well-known results that 

nVar s (j^(\X) 2 j -A 2T af ds 

for the realized variance in a one-dimensional setup, where the asymptotic variance hinges on the so-called 
integrated quarticity, and 

nVar E ^A.X^A^A T f\l + p 2 s )(a^a^f ds 

in a bivariate model with spot correlation process p s . Already in the two-dimensional model we addition- 
ally obtain asymptotic covariances between realized variances and the realized covariance 

^A t X^A l X^K^{A l X^)A^2T p s {a^fafUs. 

The key steps for proving Q are the approximation 

AjX » g (i - 1)T (Wir - W li-iyr) , (5) 
more precisely given in the Appendix [A] and the formula 

Cov (z®Z«\ zWzW) = S im S te + S ra E im . (6) 



for a multivariate normal Z ~ N(0, S) with covariance matrix (Ey). The right-hand side of |5]l is condi- 
tionally on Fu-\)T/n centered Gaussian and this finding will be helpful, since by the martingale structure 
of realized (co-)variances and estimation errors in the upcoming sections below, the asymptotic covariances 
are given as limit of the sequence of conditional covariances. Hence it will be possible to apply |6]) which 
is a special case of the general formula for moments from a multivariate normal by Isse rEs| ( f"l918| l. 



3 Inference for observations with microstructure noise 

Assumption* 2. The process X is observed synchronously on [0, T] with additive microstructure noise: 

Yi = X ti + e, , i = 0, . . . , n . 

The ti , < i < n, are the observation times and we assume that there is a constant < a < 1/9, such 
that 

S n =snp((t i -t i ^ 1 ),t ,T-t n )=o(n-^ 9 - a ^ , (7) 

stating that we allow for a maximum time instant tending to zero slower than with n~ l , but not too slow. 
The microstructure noise is given as a discrete-time process for which the observation errors are assumed 
to be i. i. d. and independent of the efficient process. Furthermore, the errors have mean zero, and fourth 
moments exist. 

Exact orders in (|7]i and below in ( fT8j ) and ( p5| ) arise from upper bounds of remainder terms after apply- 
ing Holder inequality. We keep to the notation 

AiX = X u - X u _ t and A\X = X u - X u _ } , 1 < i < n, 2 < j < i . 

Since notation varies between papers, note the correspondence to the other main form: 

AiX is the same as AX ti . 
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The covariance matrix of the vectors ej , < j < n, is denoted H and for p = 4 we set 



/ 



H 



Vl2 Vl3 



V12 m rj2i 

Vl3 V23 m 2 V34 
\ VU V24 V34 V4 2 ) 



(8) 



Note that an i. i. d. assumption on the noise is standard in related literature, an extension to m-dependence 
and mixing errors can be attained as in Ait-Sahalia et al. ( 201 1) . For notational convenience of asymptotic 
variances, we restrict ourselves to i. i. d. noise in this section - in the general asynchronous framework 
below asymptotic covariances of generalized multi-scale estimates are not affected by the noise. 
Increments in such a microstructure noise model 



A j Y = 



fi s ds - 



<J S dW s + €j 



e i-i 



are substantially governed by the noise, since the second addend is O p (Sn ) and the drift acts only as 
nuisance term of order in probability O p (S n ). For an accurate estimation of the integrated covariance matrix 
in the presence of noise smoothing methods are applied. We now discuss several main approaches and 
integrate them in a unifying theory. To this end, we show that two prominent methods are asymptotically 
equivalent. 



3.1 Asymptotic Equivalence of the Multi-Scale and Kernel Estimators 



For the estimation of integrated variance the following rate-optimal estimators with similar asymptotic 
behavior have been proposed in the literature: a multi-scale approach by Zhang (20 06), pre-averaging noisy 
returns first as in Jacod et al. ( 2009 1, the kernel approach by Barndorff-Nielsen et al. ( 2008} and a Quasi- 
Maximum-Likelihood-Estimator by Xiu (2010). We investigate the covariance structure of the multi-scale 
estimator explicitly, but since all these estimators have a similar structure as quadratic form of the discrete 
observations, analogous reasoning will apply to the other methods. In particular, we shed light on the 
connection to the kernel approach to profit at the same time from the considerations by Barndorff-Nielsen 
([2008 ) pertaining parametric efficiency and the asymptotic features of different kernel functions. The 



et a 



multi-scale estimator 



(multi) ^ n ^ 



(2) 



(9) 



and analogous for other components, arises as linear combination of subsampling estimators that are aver- 
aged lower- frequent realized covariances using^frequencies i = 1, . . . , M^ 12 \ 
For discrete weights a,, 1 < i < M n , with Xa=! a i — ^ an d 2i=*i( Q! '»A) = 0' tne expression 



2M5 



ti 



GM: 



(ti(l)-ti(0)) 



24M£ 



(h"(l) - h"(0)) , (10) 



adopted from Zhang ( 2006 1, with twice continuously differentiable functions h satisfying J Q xh(x) dx 



and Jq 1 h(x) dx = 0, gives access to a tractable class of estimators. The multi-scale frequencies are chosen 

Mn kl ^ = cus/n with constants Cki, (k, I) G {1, 2, 3, 4} 2 , minimizing the overall mean square err or to order 
n -i/4 The estimator is thus rate-optimal according to the lower bounds for convergence rates by |Gloter & 
Jacod] ( pOOT) and |Bibingerl ( |2011b| l. 

At the present day, it is commonly known that the nonparametric smoothing approaches to cope with noise 
contamination have a connatural structure and related asymptotic distribution. A prominent intensively 
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studied alternative to the multi-scale approach is the kernel estimator by Bamdorff-Nielsen et al. (2008 ) 



— — — (kernel) " 



+ (w ( £ A^A^y^ + A^yWA^ 2 )), (ii) 
h=i K nJ j=h+i 

with a four times continuously differentiable kernel & on [0, 1], which satisfies the following conditions: 



l r i r i 

2LU» / f«'^H2j„ / (allt\\2. 



R 2 (x)dx, / (X(a;)) 2 cfc, / (£"(x)) 2 dx }< oo, £(0) = 1, £(1) = it'(0) = &'(!) = 0. 



In the one-dimensional setup ( fTTj ) has the shape of a linear combination of realized autocovariances of 
the discretely observed process. The subsequent explicit relation between kernel and multi-scale estimator 
enables us to embed the findings about several kernels and the construction of an asymptotically efficient 
one for the scalar model provided by Barndorff-Nielsen et al. (2008 ). Since the multi-scale approach 



exhibits good finite-sample properties in the treatment of end-effects, it can be worth to road-test resulting 
transferred multi-scale estimators in practice. 

Theorem 3.1. For each kernel function & matching the assertions above, for the estimator defined in (JTTJ 
and the multi-scale estimator (|9]l with weights determined by (|T0]> and h — we have 

/ — (multi) — (kernel) \ 

nW[X(i),X(2)] T - [XW,XM] T +4^ 12 j^0, (12) 

as n —> oo, M n = H n — > oo. The term is due to the different impact of end-effects in \\\\ and |9| 
and, since the variance-covariance structure carries over to adjusted unbiased versions of the estimators, 
is not crucial for the relation of the asymptotic (co)variances. 

3.2 Asymptotic Equivalence of Adjusted Estimators 

The multi-scale and kernel estimators defined in (|9| and ( fTTj ) are sensitive to end-effects which is caused 
by the dominating noise component (which does not depend on ri). Due to end-effects, on Assumption ^ 
the estimators |9| and ( fTT) with weights determined by ( fTO] ) and corresponding kernels have a bias — 2/712 
and 27712, respectively. We here investigate a correction to each of the two types of estimator: 



Correction to Multi-scale: Follow Zhang (2006) by modifying the first two weights 



«i^ai + 2/n, a 2 ^ a% - 2/n, (a i ) 3 < i <M„ (ai)3<»<M„- (13) 
Correction to the Kernel estimator: 

n— 1 

multiplying the realized covariance in the first addend with . (14) 



This correction is different from the 'jittering' approach provided in Barndorff-Nielsen et al. (2008 
We call the adjusted estimators, respectively, 

(multi, adj) — ■ — ■ — (kernel, adj) 

[XW,XW] T and [XW,XW] T 

With these adjustments, we obtain the following direct equivalence of the two estimators. 
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Theorem 3.2. Under the assumptions of Theorem 3.1 for each kernel function M, we have 



(- (multi, adj) — (kernel .adj)\ 

[XW,XWY T -[XW,XWy T ' J^0, (15) 



00, M n = c mu iti^/n and H n = c ke 



Section 2.6 p. 1487-88 of Bamdorff-Nielsen et al. 2008 . 
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kernel 


Si 


cubic 
Parzen 
rth Tukey-Hanning 


1 - 3a; 2 + 2x 3 
(1 _ 6a; 2 + 6a; 3 )l {x < 1/2} + 2(1 - xfl {x>1/2} 
sin(f(l-xr) 2 


kernel 


first-order weights at 


cubic 
Parzen 
rth Tukey-Hanning 


12i 2 6i 
(M)3 (M) 2 

^ (3« - 12) for < < M/2 and ^ (12 - for i > M/2 

™-(i-HT~ 2 ((>--i)sin M 1 -^)H+^(^-i) r cos(7r(i--i r ) r )) 

2M 2 



Table 1 : Collection of important kernels and corresponding weights for the multi-scale (first order term). 



The extension from H n = M n in Theorem 3.1 to asymptotically of the same (optimal) order follows 
directly, by inserting the minimum in the transformations in the proof, and by elementary bounds for the 
remainder. 



Remark 1. (Dependent noise.) In the case of m-dependence it will be convenient to discard the first m 
frequencies and renormalize in d9j. The adjusted estimator is robust. 



Remark 2. (Strong representation.) The results in Theorems 3.1^3.2 are similar to other "strong repre- 
sentation" results in the high-frequency literature, such as in Zhang (2011) (see key equation (39) on p. 
41 ) and Mykland et al. (2012), Theorem 4. (The convergence is in probability, but is comparable to strong 
representation through a standard subsequence-of-subsequence argument.) 



3.3 Optimal choice of weights, and Asymptotic distribution 

The standard weights employed in Zhang (2006) and Bibinger (201 lb): 



12z 2 



G?' 



6?' 



,((MA 
I2i 2 



(12) 



((Mn 2) ) 2 



((Aft 



(16) 



(M, 



(12)> 



{m£ 2) ) 



(1+0(1)) 



minimize the asymptotic noise variance and lead to, as mentioned by Barndorff-Niel sen et al. | ([2008 ), the 
same asymptotic properties as for the kernel estimator ( fTTj i with a cubic kernel. However, as derived by 
Barndorff-Nielsen et al. (2008 1 there are kernels surpassing the cubic kernel in efficiency by shrinking the 
signal and cross parts of the variance while allowing for an increase in the noise variance and striving for 
the best balance of all three. A fourth term appearing in the asymptotic (co-)variance, see ( fTTj i below, 
induced by end-effects and noise, can be circumvented by their 'jittering' technique. Asymptotically, 
Tukey-Hanning kernels as listed in Table [T] combined with this 'jittering' attain the optimal asymptotic 
variance in the scalar case known from the inverse Fisher information in Glote r & Jacod| ( [2001| l. All weights 
(jTOf satisfy the relations 2~2iti a i = 1 an d Sf=i a « / '* = 0- Classical pre-averaging is asymptotically 
equivalent to the Parzen kernel. This linkage has been shown by Christensen et al. ( 2010) >; see also the 
discussion in Jacod et al. (2009) (Remark 1, p. 2255). At this stage, we derive the asymptotic covariance 
structure for the typically considered equidistant observations setup and we will extend this to irregular 
sampling below in our general non-synchronous model. 
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kernel 




£>" 


m a 




cubic 


12 


13/70 


6/5 


6/5 


Parzen 


24 


3/4 


151/560 


15/40 


1st Tukey-Hanning 


ir 4 /8 


ir 2 /l6 


3/8 


TT 2 /8 


16th Tukey-Hanning 


14374 


5.132 


0.0317 


10.264 



Table 2: Constants in asymptotic covariance for important kernels. 



Proposition 3.1. On the Assumptions^Rand *^with ti — iT/n,0 < i < n, the asymptotic covariance 
of the multi-scale estimates |9} with Aft 12 ^ = c\i s/n, M^ 34 ' = C34 \fn, c = min (C12, C34), and weights 
(JTOj, and by the equivalence also of the corresponding kernel estimates, is 

/ (jnulti) — — — — (7nulti)\ r 

ACOV^[X(D,X(2)] T , [I(3),XW] T j=2D a cTj (a^a^+ai^a^)ds 
+ 2 c- 3 (7/X37/24 + 77i4?723) + c^OH" jf (t7i 3 ^ 24) + ?724^ 13) + r^af 3 ) + 7723^ 14) ) ds (17) 

+ C" 1 ^ (?7l3?724 + 77l4?723) , 

with constants D a , Wj\ OTf depending on the specific kernel, see Table 2| 10 



A generalization for non-equidistant sampling is covered by Proposition |5.1| in Section [5] In this case, 
the first addend of ( fT7] i (signal term) hinges on a function ( [29) , while the other terms are analogous. In- 
serting '(1) = (2) = (3) = (4)' in the general formula (fTT]), we obtain the asymptotic variance of the 
one-dimensional multi-scale estimator as given in Zhang ( 2006) . Also, for (1) = (3) and (2) = (4), we 
have the asymptotic variance of the integrated covariance multi-scale estimator as given for the special case 
in which H is diagonal in Bibinger ( |2012) . 



4 Inference for non-synchronous observations in the Absence of Mi- 
crostructure noise 

This section is devoted to the estimation problem under sampling with different observation schemes in 
each component. Denote n = m + ?i2 + n.3 + 114, the total number of observations. For deterministic real 
sequences we introduce the notation a n ~ b n to express shortly that a n — 0(b n ) and b n = 0(a n ). For 
sequences of random variables we analogously use a n ~ p b n if a n = O p (b n ) and b n = O p (a n ). 

Assumption** 2. Components of the process X are observed discretely at times tf\o < i < ni,l = 
1,2,3,4, which follow sequences of observation schemes for which n\ ~ ni ~ 713 ~ n^, and with a 
constant < a < 1/3 it holds that 

^ = sup((^-^ 1 ),4°,T-4')) =0(n-^) . (18) 

Similarly as before, we denote componentwise 

AiA-W = X® - X^^ and A{X^ = X^ - X® , 1 < * < n h 2 < j < i , I e {1, 2, 3, 4} . 

'"General expressions for the constants 2) Q , 91?, and 9Jl a are the same as in|Zhang||2006| and|BibingerM2012|, since they 
apply equally in the special cases discussed there. In this paper, we do not focus so much on the theoretical expressions for constants, 
as our focus in the feasible CLT. 
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Under non-synchronous sampling the equitable estimator for the integrated covariance is the following 
generalized realized covariance by Hayashi & Yoshida (2005): 



(HY) ni ™ 2 

\XV>,XV)}„ =yyAilWA j l( 2 'L . , ((1)f p) K „<i),mu, (19) 

L ' JT / j / < 1 J [m\n(t\ ' ,t\ )>ma,x(t\_ 1 ,t^_ 1 )\ ' v y 

i=l j=l 

where the sum comprises all products of increments with overlapping observation time instants. This 
estimator is asymptotically unbiased, i. e. unbiased and UMVU in the absence of a drift term, and can 
be deduced as Maximum-Likelihood estimator in a model with deterministic function £ t as illustrated in 



Mykland ( 2012| l. If supposed that we have sequences of sampling schemes for which some characteristic 



features, specified in detail below, have a limit describing an asymptotic behavior of asynchronicity, a stable 
central limit theorem with optimal convergence rate y/n has been established and there are also feasible 
versions (cf. Hayashi & Yoshida (2 01 1 1 > and Bibinger ( |201 la| l). The asymptotic variance is in general larger 
than for Q in the synchronous case and hinges on functions capturing the superposition of the two sampling 
times designs. For this reason, the analysis of the covariance structure in a multi-dimensional setup gets 
more involved, since for a covariance we confront a superposition of four different sampling schemes 
instead of two. For a more illustrative description, we use the illuminative rewriting of the Hayashi- Yoshida 
estimator from Bibinger (201 la) with a synchronous approximation and an uncorrelated addend due to the 
lack of synchronicity. 

For this purpose we introduce the notion of next- and previous-tick interpolations: 

tt(s)= min (&d l) >a)w&t7(a)= max f#|t i W <s 
' »e{0,...,n l } V 1 1 ~ I 1 w ie{0,...,n,} V 1 1 - 

for I E {1,2,3, 4} and s G [0, T]. One way to rewrite ( fT9] l using telescoping sums is: 

(HY) J'ZlL / \ 

z— 1 x ' 



»2 



.7=1 



For the generalization of the idea of closest synchronous approximations define 

T 12 = max (t+(0), t+(0)) , T, 12 = T]\ + max (4 (Tl4),4 ^4)) , i = I, ■ ■ ■ ,N V _ 



T, 



■34 



(4(0),4(0)) , If = Tf\ + max (t+ 4(T^)) , i = 1, . . . , N- 



34 ■ 



The times T, 12 , T, 34 , are the refresh times from 



Barndorff-Nielsen et al. 



(201 1 ) built for each pair of pro- 



cesses and thus we will refer to these times, which coincide with the ones defined in a slightly different 
manner in 



Bibinger| ( |2011a[ ), as refresh times in the following. The notion of next- and previous-ticks will 

(T™\Tt > s) , 



be applied analogously as above to refresh times: 



T&s) = . min x (TP\TP > s) andT 3 +( S ) 

i£{0,...,N 12 } 



mm 

ie{o,...,N 3i } 



and T^(s),T 3 4(s) in the same fashion for s £ [0, T}. Writing X^\4 for X (l } I = 1, 2, and X ( S> 12 
for X } , 12 ., I = 1, 2, the Hayashi- Yoshida estimator fl9] > can be illustrated 



(HY) 



N 12 

E 

i=l 



I T 1 . 12 T 12 y I T 12 



F (2) - 
T. 12 , 



(20) 



where iV 12 < min(n 1 ,n 2 ) is the number of refresh times T} 2 . This illustration is particularly useful 
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Figure 1: Interaction of next-tick and previous-tick interpolation terms affecting the covariance. 



to decompose the Hayashi-Yoshida estimator in two uncorrelated addends - a fictional non-observable 
synchronous realized covariance and the error due to the lack of synchronicity: 

[IPU< 2 )] { " Y) = D 12 + A X2 (21a) 

with 

JV12 

D T 2 = (x^i 2 — X^ 2 ^J (x!^i2 — X^ 2 ^J , (21b) 

3=1 



JV12 

At = 'y ] (^X T i2 + — -^t 12 ) (^t 12 — X T i2 ^ + (^X T i2 Jr — X T i 2 ^j (^X T i2 — X T i2 ^ (21c) 
3=1 

Each increment in ( pO| i is the sum of three addends, the increment over the refresh time instant and 
a previous- and next-tick interpolated increment, where in each product only one of the next-tick and 
previous-tick increments is non-zero. 
The covariance is hence given by 

E [D^Df + A 12 Df + A T A D T 2 + A l2 Af} - E [D 12 ] E [Df] 
= E [D^D™] + E [AfAf] - E [D 12 ] E [£ 34 ] + o(l) . 

The products D^A^ and D^A^ 2 tend to zero in probability, since the expectation for the Brownian parts 
equals zero by one factor in each addend of an interpolated increment over an interpolation time instant 
disjoint to the instants of the other factors. The product D T 2 D T 4 comprises the refresh time sampling 
for the pairs of processes. It can be decomposed passing over to refresh times of refresh times (which 
are the refresh times of all four processes in the definition of |Barndorff-Nielsen et al.\\10\\\ ) in the way: 
D^Df = D 12M + A 12 ' 34 . The idea is the same as for the usual Hayashi-Yoshida estimator ( f20] > as we 
have two non-synchronous sampling designs by the sequences T 12 , < i < 7V 12 ,T ; 34 , < I < N34,, For 
this purpose define 

S Q = max (T 12 , T 34 ) , $ = + max (^(Si-i), ,i = l,...,N. 

Here, different as for the variance of one Hayashi-Yoshida estimator the term A^ 2 ' 34 , comprising interpo- 
lations from T 12 , < i < AT 12 , and Xf 4 , < / < 7V 34 , to the times S k , < k < N, will not contribute 
to the asymptotic covariance. The term has in general a non-zero expectation depending on tr' 12 )^ 34 ), 
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but is compensated by terms in E [-Dj 2 ] E [Dj^] . Therefore, two terms will constitute the asymptotic co- 
variance of two Hayashi-Yoshida estimators. A simple covariance term as in the presence of synchronous 
observations at the refresh times Sk, < k < N, and the product A^Ap 1 . For the latter all products of 
terms of the above given type including next- and previous ticks which overlap in time can contribute to its 
expectation. To quantify this for general observation schemes we consider an auxiliary structure founded 
on the partition of [0, T] in the refresh time increments (Sk — Sk-i), k = 1, . . . , N. For each increment 
(Sk — Sfc-i), k = 1, . . . , N, we consider the refresh times T 12 , Tj 34 , lain next outside the interval and 
within the interval, i. e. T^(Sk), T^(Sk), T^l(Sk), T^(Sk), where at least one of the pairs equals Sk and 
the same for iSfc_i. This is visualized in Figure[T]in which observations at pairwise refresh times are il- 
lustrated by dots and "missing" observations by circles. On the left-hand side arrows and bars and on the 
right-hand side colored rectangles mark the intervals where interpolations for pairs intersect and thus con- 
tribute to the covariance. Although we have no further interest what observations take place between the 
considered refresh times, in particular more pairwise refresh times of one pair can lie in this interval, inter- 
polation terms of those will not overlap with one of the other pair. We only need to focus on the depicted 
borders of the intervals and the correlation between adjacent steps. As visualized on the left side of Figure 
[T| next-tick interpolation terms at 5, and the last precedent refresh time of the other pair have a non-empty 
intersection and contribute to the covariance. It is also possible - as highlighted on the right side of Figure 
[T] - that the next-tick interpolation term at the subsequent refresh time of the other pair and at Si overlap 
and have a non-zero correlation. The same findings apply to previous-tick interpolation terms. Together 
with the covariance between next- and previous-tick steps at the same Si of adjacent terms, which can be 
quantified similarly as for the variance in Bibinger (201 la), we have captured all possible terms triggering 
the covariance by non-synchronous sampling. 

Definition 1. Define the deterministic sequences 

GN ^) = % E - 5 -i) 2 > {22 ^ 

Si<t 

Fu N (t) = ~ E (min(i+(^),<+(5,))-^) + (^-max(t-(^_ 1 ),i-(5 l _ 1 ))) (22b) 

S i+ i<t 

+ (Si - min (if (Si-i),t^ (Si-i))) + [Si - Si-x) 
+ (min (t+(Si),t+(S t )) - S,) + (S< - max (f^i-i),*! ($-0)) 
+ (S i -miD.(t2(Si-i),t^(Si- 1 ))) (Si — Si-i) , 

^ (mm (tf (Si),t+ (Si)) -Si) + (S i+1 -max (t^(S i+1 ), t^(S i+1 ))) + (22c) 

S i+ i<* 

+ (min (t+(Si),t+ (Si)) -Si) + (S l+1 -m & x(ti(S l+1 ),q(S l+1 ))) + , 



H% N (t) 



I$ N (t) = T E (S l ~m ax (T 1 - 2 (S l ),T 34 (S l ))) + (22d) 

S i+ i<t 

x (mm(max(t+(SO),#(Si), 

+ (min (r i2 (S , i_i),r 34 (S'i_i)) - max (if (Si_i), if (5i_i))) (min (T^(Si-i), T^(Si-i)) - Sj-i) 

+ (Si -max(Tf 2 (5 l ),r 34 (5 i ))) + 
x (min (max («+($,)), t+(S<), max (t+ (T+(5i)), t+ (T 3i (S t )))) - max (T+(S 4 ), T 3 +(S. t ))) + 
+ (min (T 12 (5 i _i),r 34 (S' i _i)) - max (if (Si-i), if (Si-i))) (min (Tj^(5i_i),2^(5i_i)) - Si-i) + 

where ( • ) + denotes the positive part, and analogous sequences ' N , H^t'^ > ^23 ' N of monotone increas- 
ing functions on [0, T]. These are called quadratic covariations of times in the sequel. 
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Assumption 3. Assume that for the sequences ( |22a| >, ( |22b[ ), \22c\ and \22d\ from Definition^the following 
convergence assertions hold: 

(i) G»{t) G(t),F£»(t) -> FH, F^ N (t) i^^W ^ HH{t),Hlt' N {t) -+ H%(t), 
^24 (*) ~~ * -^af COi ^23' (*) ~~ ^ -^23 (*) as N 00, where the limits are continuously differentiate 
functions on [0, T]. 

(ii) For any null sequence (hjy), hisr — O (iV -1 ) 

G N (t + h N ) - G N (t) 



h N 



->G'(t), (23a) 
F.!r I* + '>>)- Eir® . F u' (t)i (23b) 

, (23c) 





ft, AT 






- ftjv) 






/l at 













lit (*) , (23d) 

uniformly on [0,T] as N — > 00 and analogously for H^' , F^ ' N , I23' ■ The limiting functions are 
called asymptotic quadratic covariations of times in the sequel. 

Assumption [3] which generalizes the notion of a quadratic variation of time for the one-dimensional 
case by Zhang eFa/.| ( |2005] l, directly postulates that sequences of time instants appearing in the sequences 
of conditional covariances after applying Ito isometry and <|5j converge to some limit. This is a rather weak 
condition and necessary for the existence of asymptotic covariances. 

Proposition 4.1. On the Assumptions [7] *^^and^the asymptotic covariance of two Hayashi-Yoshida 
estimators is 

/ — - — (HY) . — - — . (HY)\ r T 

ACOv([A(i),X(2)] T ,[XW,XW] T )=Tj CW^VW+^VWjd! (24) 

+ t£ (Fg + + III') -PM 24) ds + T £ (< + Hi? + III') 4 23) 4 14) ds . 
For '(1) = (3)' and '(2) = (4)', we find the asymptotic variance of the Hayashi-Yoshida estimator as 



illustrated in 



Bibinger (2011a). In this case J 2 4 i ^23 ^23 are zero. 



5 The general case: Asymptotic covariance matrix of the generalized 
multi-scale estimates under Asynchronicity and Microstructure 

This section focuses on the general model - comprising non-synchronous sampling and noise perturbation 
- and an hybrid approach founded on a combination of the estimators from Sections [3] and |4] 

Assumption*** 2. The process X is observed non- synchronously with additive microstructure noise: 

Y % = X % + e f > 3 = 0, . . . , m, l € {1, 2, 3, 4} . 

j 

The sequences of observation schemes are regular in the sense that n\ ~ 7J2 ~ n>3 ~ "4 and with a 
constant < a < 1/9 it holds that 

S„ = sup((tf -t^.^r-tCO) =0(n-*'°-« ) . (25) 

The observation errors are i. i. d. sequences, independent of the efficient processes, centered and fourth 
moments exist. Noise components can be mutually correlated only at synchronous observations. 
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In the following we establish the asymptotic covariance matrix for the generalized multi-scale method 
from Bibinger ( |2011b|l and[B ibinger ( 2012| l. It arises as a convenient composition of the multi-scale real- 
ized ( co-) variance by | Zhang|(|2006| l from Section[3]and a synchronization approach inspired by the estima- 
tor by Hayashi & Yoshida (2005). Virtually we can think of an idealized synchronous approximation by 
refresh times, apply subsampling and the multi-scale extension to this scheme, and afterwards interpolate 
to the next observed values on the highest available frequency. Reviving the notation from Section |4j the 
generalized multi-scale estimator can be illustrated: 



M (12) 

£ 



N 12 

£ 



X 



(1), 



X 



(1):- 



)( 



X 



(2), 



X 



(2) - 



(26) 



This estimator crucially differs from the approach by Christensen et al. ( 201 l| l, which has the form of the 
traditional Hayashi-Yoshida estimator, but bound to a low-frequency scheme of pre-averaged observations 
over blocks of order ^/n high-frequency observations. The estimator ( |2"6j l relies more on the principle of 
the refresh-time approximation - but contrary to Barndorf f -Nielsen et a/.| ( |20TT} - we use pre- and next-tick 



interpolations such that the final estimator has no bias due to non-synchronicity. For this reason the article 



on hand can not accomplish a unified theory that is applicable to alternative approaches by Barndorff- 



Nielsen et al. (201 1|>, |Ait-Sah alia et aZ7|( |2010| l and|Christense n~e"f q/.| ( |201 1) , since unlike their roots from 
Section [3] they are not connatural any more. We consider ( p6| ) because the method attains a much smaller 
discretization variance in comparison to the one by Christensen et al. (201 1), is rate -optimal and a feasible 
central limit theorem is accessible from Bibinger (2012 1. 



Remark 3. (Identical results for kernel estimators.) Since equations (|7]l and ( |25[ ) are the same, it follows 
from Section [3] that our results on irregular sampling for the synchronous case, where the generalized 
multi-scale estimator ( |26| ) coincides with the original one in the following apply identically to kernel 
estimators. Furthermore, all results for the estimator ( |26| l apply to a generalized kernel estimator with 
pairwise refresh time sampling as in \26\ . 

Definition 2. For given sampling schemes ;0 < j < ni,l < I < p ~ 4 and r < ni, define the 
functional sequences 



rAj 



ec*) ■•= £ £ (*S° -^)E(^-4V^ 



(27) 



for each component and analogously for refresh times X*', < j < Nki, (k, I) £ {1, 2, 3, 4} 2 , introduced 
in Section\3\ 



T kl <t 9=0 



N, 



kl 



rAj 



(28) 



Denote Q5N,r{t) os the function build in the same fashion from the refresh times of all four observed com- 
ponents Sj , < j < N. 

The existence of a limit & of the sequence in Definition [2] is essential to establish an asymptotic distri- 
bution theory, since it dominates the terms that appear in the (co-)variances of the multi-scale and related 
estimators and contribute to the asymptotic (co-)variance, namely the following existing limit: 



D a (t) := lim 



N 



N 



S r <t 



AS r 



OLiOtk 



i,k=l 



rAiAk 

E 

q=0 



1 - 



1 - 



AS r 



(29) 



In the equidistant setup D a (t) = D a t with the constant D a found in Proposition 3.1 We will call the limit 
& in case of existence local asymptotic sampling autocorrelation (LASA). If we focus on the special case 
'(1)=(2)=(3)=(4)', convergence of |27) is assumed for the one component. 
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The preceding definition suffices to quantify the influence of non-equidistant synchronous schemes on 
the asymptotic properties of the multi-scale estimator, but to give a very general asymptotic covariance 
structure of the generalized multi-scale estimator in a transparent form, we can not avoid to introduce some 
tedious notation in the following. Readers interested mainly in the usual completely non-synchronous 
setup, where the asymptotic covariances of estimates involving different components only hinge on the 
signal parts, may proceed with Corollary |5.2| 

Definition 3. Depending on the sequences of sampling schemes, define the following sequences of func- 
tions: 



(30a) 



J — k — 



and in the same way (t) , (t) and S24 (t). Define in the case of existence for given Af^ 2 ^ , M^ 4 ^ with 
M N =mm{M% 2 \M$ i) ): 

N 12 N34 JAM^ 2) fcAM<f 4) 

^'il^^SS 5- 5- ( 1 {^(Tf)=t+(T^), i -(TP r )=t7(T?i <! )} ( 31a > 
j=0k=0 r=l q=\ 

+ l { i+ (T 12 )=t + (T 34 )iti - (T ^2 r)=i - (T 34 a)} 
N12 N 34 JAJ\4 12) fcAMf 1 

6 » = A m M rlM ^^^ ^ ^ (%M 2 )=*t( T r).* 2 -(^=*s(^)} (31b) 

j=0k=0 r=l q=\ 

+ 1 {ti{T} 2 )=tt(T^),t-(Tjl r )=t-(Tjt q )}) ' 
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lim M" 1 



«r'-inr'-i 



N- 



3=0 



k=0 



( 1 {t+(Tf)=t+(Tf), 4 +(T/ 2 )=t+(T^)} 



(31c) 



+ 1 



K -J )=*3 (^4 - *) & ( T «12 - i )=*4 ( T «34 - * ) > J ' 



M<->-lMlf 4 '-l 



j=0 k=0 



+1 



Assumption* 3. AssMme that for the sequences ( |27| l/ (|28|> from Definition^ and the sequences from Defi- 
nition^ the following convergence assumptions hold: 

(i) As N — > 00 and r — > 00 with r = o(N): <3ff >r (t) — > S5(i), for a continuous differentiable limiting 
function 25 on [0, T]. 

(ii) For any null sequence (Jin), fiN — O (iV -1 ): 

® N , r {t + h N ) -& N , r {t) 

h N 

uniformly on [0,T] as N — > 00. 

(Hi) S N 3 (t) -> 513, £u(t) -> #14 (t), Sg 3 (t) -> S23(t), -> 5" 2 4(i) fli iV -> 00, where the limits 

are continuously differentiable functions on [0,T]. 



<S'(t) (32) 
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(iv) For any null sequence (hp?), hjy — O (N 1 ): 



S 13 (t + h N ) - S 13 (t) , S u (t + Hn) - S u (t) , 

r > i 7 14 v J ' l" a J 

S^ 3 (t + Hn) - S 23 (t) , S^(t + Hn) - S 2i (t) , . 

r > A 23 (t) , r ► *24W . (33b) 

uniformly on [0,T] as N — > oo. 

Proposition 5.1. (9n f/ze Assumptions [7] ** ! |2]flnc/ f/;e asymptotic covariance of generalized multi- 
scale estimates ( |26| > w/f/i M^ 12 ' = c\ 2 \J N\ 2 — c\fN \ = 034^/^34 = c*y/N, c ~ min (c, c*), 
weights ( [T0l > J * gzVen fry 

[XM,XW] T ,[XW,XW] T )= 2cT J ( Da Y( s )(°i 13) ^ 4) +°i 14) °? 3) )d s 
+ a" 3 (6 2 ^ 2 |, Q? 7i377 2 4 + ©M.a^a) + cr 1 (elitliaV^ + &ll^l a Vl4V23) (34) 

+ c - 1 m a 



with the existing differentiable limiting function \29) hinging on ( |28| l. All constants Q , ©£ Q depend on 
the asymptotic proportion of sampling times where pairs of two components are recorded synchronously 
and the selected weights. 

In a synchronous setting clearly &\\ = <S\\ = &\\ = &\\ = 1, = £?f = 2015* (= 24 for the cubic 
kernel), = £23 _ (=6/5 for ±e cubic kernel) and ^ = s ^ = s ^ = = 1[o ^ and rjg 

coincides with (jT7j except the influence of irregular sampling. In particular, the asymptotic variance of the 
multi-scale estimator for synchronous non-equidistant sampling yields 

/ (multi)\ fT 

AVARMXW,XP)] T )^ 2cT j (^ Q )'( s )(^ 11)(J i 22) + (^ 12) ) 2 ) ds 

+ 2 m<? cr 3 {riinl + viz) + c-'m a J T (r/ 2 ^ 22 ) + t^ 11 * + 2 r^ 12 )) ds (35) 

+ c-^ (VW2+VI2) ■ 

Interestingly, in most situations if &\\ — &ff — 0, the noise part will vanish. Furthermore, we obtain the 
following important result for the completely non-synchronous case: 

(71 (k) 

Corollary 5.2. In the case that no synchronous observations take place: t\ ^ t - for all I ^ k and 
€ {0, . . . , n{\ x {0, . . . , rife}, or the amount of synchronous observations tends to zero as n — > oo, 
using the same notation as in Proposition 5.1 and on the same (remaining) assumptions, we have: 

[X^,XW] T , [XW,XW] T ' )= 2cT J {D a y(s)(<ri 1S) 4 M) +4 U) cri 23) )ds, (36) 
with the existing derivative of the limiting function \29\. 



Here, formula ( |36| l makes sense only for different components and we may not insert '(1)=(3)' or 
likewise as above. However, the relation is meaningful for the asymptotic covariance between integrated 
variance estimators. 

Remark 4. Our major focus is not on the theoretical limits © and of other sequences, since in the general 
case they are specified only as limits. We do not need these values, however, for inference, as we shall see 
in the next section on feasible inference. 
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Note that convergence of \21\ is the natural assumption to derive a central limit theorem for irregularly 
spaced (non- equidistant) observations already in the one-dimensional framework. It emulates the asymp- 
totic quadratic variation of time for realized variance to an asymptotic local autocorrelation of sampling 
time instants which constitutes the counterpart to the sum of squared time instants emerging in the vari- 
ance for subsampling and the other smoothing approaches. The only difference is that not directly the limit 
of \21\ will appear in the asymptotic variance, but some limiting function additionally involving specific 
weights ( the kernel). If we think of random sampling independent ofY, the structure of ( |27| > will be partic- 
ularly simple for i. i. d. time instants. Virtually, only the expectation will matter and we can apply the law of 
large numbers. Assuming ([32} is less restrictive than the assertion in Zhang (2006), i. e. sampling needs not 
to be close to an equidistant scheme in the sense that asymptotic quadratic variation of time converges to T 
at T. Remarkably, for the popular model of homogenous Poisson sampling independent of X with expected 
time instants T/n, the asymptotic variance §35) is the same as for equidistant observations. This emanates 
from the i. i. d. nature of time instants and the vanishing influence of the first addend 2T / (nr) in ( |27| l as 
r — > oo. The finite sample correction factor in ( |32| l for this Poisson setup is thus (r + 1) jr. 

At first glance the simple appearance of the covariance between generalized multi-scale estimates in 
the typical general setup where all observations are non-synchronous and in the presence of microstruc- 
ture noise is intriguing. The covariance hinges only on the discretization error as if we had synchronous 
observations at the refresh times Si, i = 0, . . . , N. The noise falls out of the asymptotic covariance on the 
assumption that observation errors at different observation times are independent. 

This feature constitutes another nice property of the generalized multi-scale method that not only the 
asymptotic variance of the estimator has a simple form, where the impact of non-synchronicity falls out in 
the signal part, but the asymptotic covariances are even simpler and particularly are not influenced by the 
superposition of the four underlying different sampling schemes. Here, we benefit by the fact that for the 
construction of |26} a smoothing method to reduce noise contamination is utilized which at the same time 
smoothes out interpolation effects and hence the error due to non-synchronicity. 



6 A feasible multivariate stable central limit theorem 

In the sequel, we conclude a feasible stable central limit theorem for linear combinations of estimated 
entries of the integrated covariance matrix with estimators of the types considered above. A remaining 
step towards a feasible asymptotic distribution theory allowing to draw statistical inference, is to provide 
consistent estimators for the asymptotic covariances from Propositions |2 . 1 1 [3~Tj |4. 1 1 and |5 . 1 1 In the vein of 
Bibinger (201 2}, we construct consistent asymptotic covariance estimators in the general non-synchronous 



framework, following a histogram-type approach. For the simple case in the absence of noise and non- 
synchronicity, we give a simple-structured estimator resembling the prominent bipower variation. 

Proposition 6.1. In the setting of Section^ the estimator 

(n n \ 

i=l i—\ ) 

n-1 

= fJ2 KX^A^X^A^A^X^ + A l+1 X^A,X^A t X^A l+1 X^ , (37) 
is a consistent estimator of the general asymptotic covariance according to 
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On the assumptions imposed in Proposition 5.1 the estimator 

____ / — (multi) (rnulti)\ 

ACOVMX«,X( 2 )] T , [XW,XW] T 

A[Xi^( 3 )]\[X(2\XW] n + A[XW^( 4 )]"[X(2Tx(3)]" D%(T) 

+ c- 3 (e^^Mu + efMlaVums) + c- 1 (e?|£?f )Q 7)i37)24 + &ilhl, a vums) (38) 



K^ V23 ^ A^ K n '• 



gives a consistent histogram-wise estimation of the general asymptotic covariance from ( |34| l. Here we use 
Df := inf {t G [0, T]\D a (t) > jD a (T)/K N }, < j < K N , A£>f - £>f - , 

:= inf {t G [0,T]|5 /fc (i) > jS lk (T)/K N }, < j < tf^Z = 1,2, A; = 3,4, 

Ai^ = (5^), - (S^-i, 1 < j < K N , 

NS kl (T) 

fa = ~(N SuiT))- 1 J2 A i X«A m xW,fc = l,2,Z = 3,4, 

i=l 

ant/ empirical realizations for all functions based on the sampling design from Definition^ The number 
of bins Kn satisfies K^N^ 1 / 3 — > as Kn — > oo. The binwise evaluated estimators in ( |38| > are multi- 
scale estimators on each bin with multi-scale frequencies Mjv(j), 1 < j < Kn. One possible choice is 

K N = cN 1 / 5 and M N (j) = c 5 / 4 7V 3 / 5 . 



Remark 5. The feasible CLT remains valid when relaxing Assumptions of Proposition [377] on existence 
of the limit © in Assumption ^ since every subsequence of ( |29| l has an in probability converging subse- 
quence, see the discussion at the end of p. 1411 in Zhang et al. \2005 \ for analogous reasoning and more 



details. 

The estimator ( (35) simplifies in many cases, i. e. the completely non-synchronous setup, to the first 
addend. An estimator for the asymptotic covariance ( fP7) of multi-scale estimators in the synchronous 
case is inherent in ( [38) when inserting appropriate constants. One estimator for the non-noisy but non- 
synchronous model and the asymptotic covariance given in |24) may be constructed in a similar fashion as 
( |38) with histogram-type estimators based on equispaced grids with respect to the quadratic covariations 
of times introduced in Definition [T] Since the principle is clear from the above given estimator, we forego 
to explicitly state that estimator. In the synchronous setup or for I — k, the noise (co-)variance can be 
estimated ^/nl consistently by the realized (co-)variance or using adjacent increments as stated above. 
For '(1) = (2) = (3) = (4)', our estimator ((37) bec omes 2nY^{A i X) 2 {A i+1 X) 2 and dif fers from 
the standard estimator (2n/3) ^(AjX) 4 proposed in [Barndorff-Nielsen & Shephard| (2002) which is 
preferable because its variance 42 2/3n~ 1 is slightly smaller than 48n -1 of (|37[) for this case. The constant 



c in Proposition 6. 1 is fixed from the minimum constant for the multi-scale estimates here. For an algorithm 



how to select the tuning parameter adaptively involving pilot estimates to calibrate the whole estimation 



procedure first, we refer to Bibinger (2012) for the generalized multi-scale and Barndorff-Nielsen et al. 



(2008 ) for the univariate kernel estimator. 
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For a quadratic symmetric (p x p) matrix A E sym(p) C W xp , we denote the mapping to the vector of 
its p(p + l)/2 free entries 

SVEC(^L) = ((A r «j)i< r < Pi i< g < r ) T 

= (^li, ^12, ■ ■ ■ ! Ai p , A22, A23, . . . , A2 P , . . . , Ap^ip, Ap P ) T . 

Theorem 6.1. Denote \X^,X^] ,l<k<p,l<l<p, one of the integrated covariance matrix 



estimators from Sections 



5 and consider the vector SVEC ( \X( k \ XW] J ) of estimated 

\\ ) l<k<p,l<l<pj 

entries. The estimators futfui a multivariate stable central limit theorem with rate r n : 



r n SVEC ( \X( k ),X( 1 )} - \X^ k \X^\ 



stably . 



N (0, COV) , (39) 

with a symmetric p(p + l)/2 x p(p + 1) / '2- dimensional asymptotic covariance matrix COV determined 



by Propositions 



2.1 



3.1 



4.1 



and 



5.1 



respectively. The rate r n equals s/n, n = ^Zf=i n i> !W ^ e non-noisy 



experiment and n 1/4 under microstructure noise. For linear combinations 



Z 



w/f/z f/ze consistent estimators 

AVAR(Z) = ^ CfcCjQCfACOV ( , ) , 



k.k,Ll 



if the latter is strictly positive, a feasible central limit theorem holds: 

>) 



r„(Z/VAVAR(Z)V^N(0,l). 



(40) 



4.1 



Not e tha t we rescale the entries of the asymptotic covariance matrix COV from Propositions |2. 1 1 [3~Tj 
respectively, and its estimates with factors rjv /r n [(X /n)^ 1 / 2 ^ without and (iV/n)^ 1 / 4 - 1 with 



and 



5.1 



noise) to obtain ( |39| l and ( |40j >, where iV denotes the number of refresh times of the involved components. 
The multivariate central limit theorem (|39b is directly derived from the multi-dimensional version of The- 



orem 2-1 by |Jacod (T997J) which provides the basis for stable limit theorems in this research area. By 
virtue of the asymptotic covariance structure deduced in this work, it suffices to check the conditions that 
covariations of the componentwise estimation errors with the underlying reference Brownian motion W 
driving our efficient process tend to zero, as well as the covariations with all J^-adapted bounded martin- 
gales orthogonal to W. The proof by establishing elementary bounds for the variances of these terms is 
along the same lines as the univariate analysis, see |Bibinger ( 2012| l and Christensen et al. (2011 1 
others. Suppose without loss of generality for notational convenience that we have 



among 



(X n ,Y n ) st -^ v (X,Y)±N[0, 



V x 

CxY 



CxY 

V Y 



By continuous mapping we conclude X n +Y n — > X + Y stably. Assume we have at hand consistent 
estimators V£ -A V x ,Vp -A V Y , C XY -A C X y, and set V n = V% + V$ + 2C XY which is assumed 
to be strictly positive. As V n is a bounded ^-measurable random variable, stable convergence implies 



Y n ,V r ' 



weakly 



(X + Y, Vx + V Y + 2C XY ) and hence also (X n + Y^/VV 1 



■ weakly 



N(0,1). 



7 An application for conditional independence testing 

This section is devoted to the design of a statistical test in order to investigate if the correlation of two assets 
is only induced by a third one to which both are correlated. For multi-dimensional portfolio modeling and 
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management, information about such relations can provide valuable information and access to a new angle 
on the covariance structure. Conclusions in the way that a significant integrated covariance between two 
high-frequency assets might be fully explained by their dependence on a joint factor or another asset, 
respectively, will be of revealing impact on the used models. One interesting case we may think of is that 
two observed asset processes X± and X2 are listed within one index recorded as high-frequency process 
Z. We then ask if X\ and X2 are conditionally on Z independent. To put it the other way round, pairs 
which are not conditionally independent given some other asset exhibit some crucial covariance that carries 
information about the direct mutual influence between the two assets. We understand independence here 
in terms of orthogonal quadratic covariation processes and will further restrict to specific hypotheses as 
below where we test for zero integrated covariances - so the term 'independence' is misused here for a 
simple illustrative phrasing. The role of Z in the model can be also some macro variable that is known or 
can be estimated with faster rate. We set up a statistical experiment in which X\ and X2 are orthogonally 
decomposed in the sum of Z and a process independent of Z. The constants p Xl , p X2 quantify the degree 
of dependence on Z. 

X 1 = p Xl Z + Z 1 - , X 2 = p X2 Z + Z^ with [Z, Z-*-} = , [Z, Z f ] = . (41) 

With [Xi, X2) = for two semimartingales X\, X2 we express that \X\, X 2 ] s = for all s € [0, T]. For 
the conditional independence hypothesis, we set 

H : [Z^,Z^] T = 0. (42) 

Essentially, we do not distinguish between pairs for which the orthogonal parts are uncorrelated on the 
whole line and pairs for which this correlation process integrates to zero. Our focus is on a resulting zero 
covariance over [0, T]. 

A suitable test statistic to decide whether we reject Ho or not is 

1(X 1 ,X 2 ,Z) = [X U Z] T [X 2 ,Z] T -[X 1 ,X 2 } T [Z) T , (43) 

which is zero under H . In our high-frequency framework we can estimate the single integrated covari- 
ances via the approaches considered in the preceding sections. The vital point is to deduce the asymptotic 
distribution of the estimated version 

% n = [X^Z] n T [X2^Z] n T - [X^X 2 ] T \Z] T , (44) 

— 71 

where [ • ] T stands for one of the aforementioned estimators satisfying ( |39] l whichever model fits the data 
best. This test statistic, though based on the simple function g(x, y, u, v) — xy — uv, is more complex 
to analyze than linear combinations, since we face products of our estimators. In lieu of determining the 
distribution of the test statistic directly, we find help in the asymptotic covariance structure provided in 
this paper and the A-method for stable convergence. Here, the methodology is similar to the prominent 
propagation of error concept from experimental science. For each quadratic covariation, the estimation 
error gets small for large n and hence we can profit when we taylor the underlying function g. Indeed, this 
will give us the leading term of the variance of X n : 

X - T n = [X 2 , Z] T ([X u Z] T - \X^Z] T ) + [X U Z] T {[X 2 , Z] T - Mf) (45) 

- [X U X 2 ] T {[Z] T - [?£) - [Z} T {[X ll X 2 } T - [X^X*\l) + O p (r- 4 ) . 

The asymptotic variance of the test statistic is random as we are familiar with from the usual mixed normal 
limits in this field. It is a linear combination of the unknown covariations we estimate and the asymptotic 
covariances which are the topic of this article and can be estimated consistently as found in Proposition 
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|6.1| An elementary calculation yields 

AVAR(T„) = [X 2 ,Z] 2 T AVATL{[X^Z] T ) + [X u Z]% AVAR([5p?£) 
+ [X u X 2 ] 2 t AVAR([i]y) + [Z] 2 T AVAR([X^X 2 ]l) 

+ 2 [Z] T [X 1 ,X 2 ] T kCOV({X^X 2 ] T , \zf T ) + 2 [X u Z] T [X 2 , Z] T ACOV([X u Z] T , [X 2 , Z] T ) 

- 2 [Xi, Z] T [Z] T ACOV([xT^ 2 ]J, ) - 2 [ X 2, Z] r [Z] T ACOV([jCx 2 ]x, 

- 2 [X 1 ,X 2 ] T [X 1 ,Z] T ACOV([x7z]J, - 2 [X 1; X 2 ] T [X 2 ,Z] T ACOV([x7^, 
Inserting consistent estimators for the asymptotic covariances above, we finally obtain with <[39]> that 



\\r n ^Jk ^N(0,1), (46) 

AVAR(T n )/ 

where Po denotes the distribution under Ho and an asymptotic distribution free test for Ho. 
The conditional independence test also provides a tool for investigating dependencies within vast dimen- 
sional portfolios in order to choose small blocks, i. e. subportfolios for which covariances of the estimates 
are quantified explicitly. 

8 An empirical example 

We survey our methods in an application study on NASDAQ intra-day trading data, reconstructed from 
first-level order book data, from August 2010. We consider a sample portfolio with 5 assets, namely Apple 
(AAPL), Microsoft (MSFT), Oracle (ORCL), Exxon Mobil Corporation (XOM) and Pfizer (PFE). We 
quantify the complete asymptotic covariance matrix of generalized multi-scale estimates with weights ( fT6) i 
for the integrated covariance matrix over the whole month (where we discard over-night returns) and for 
the first trading day, 2010/08/02. The data provides a good training data set to analyze estimation and test 
procedures. 

For a p-dimensional portfolio, the number of free entries of the symmetric covariance matrix is given by 
lp{p+l) (p{p + l) , Jp\ , 3 .JP\ + JP\ (4?) 



2 2 V 2 J \V \V V 2 , 

The left-hand illustration is derived by the reasoning that we estimate p(p + l)/2 different entries of the 
symmetric integrated covariance matrix which leads to a (p(p + l)/2) 2 -dimensional covariance matrix 
which is symmetric again. Since the dimension of the asymptotic covariance matrix increases proportional 
to p 4 , an evaluation of all covariances is tractable only for small values of p (our estimates of risk in a 
p-dimensional portfolio have dimension proportional to p 2 and the risk of the estimates proportional to p 4 ). 
In Table [3] we list the estimates for integrated covariances ± estimated standard deviation. With the esti- 
mators ( |38| l and the numbers of refresh times of incorporated components, we quantify covariances of the 
generalized multi-scale estimates which are listed in Table |4] The bottom line is that involving asymptotic 
covariances is indispensable when facing questions for multivariate portfolio management. The estimated 
quadratic variation of a sum of all five assets is (41.57±0.26) • 10" 3 for 2010/08 and (129.22±6.88) • 10~ 5 
for 2010/08/02. The estimated asymptotic variances, 6.93 ■ 10~ 8 and 47.42 ■ 10~ 10 , are mainly induced 
by estimated covariance terms (6.34/42.91), whereas the trace of the matrix, i. e. the sum of estimates for 
asymptotic variances, is smaller. If one would mistakenly act as if the estimators were uncorrected, this 
leads to a tremendous underestimate of the estimation risk. Note that in principle, e. g. for negatively 
correlated assets, the risk could sometimes also decrease by adding covariances. For [MSFT+ORCL], 
which have the highest correlation in our portfolio, the ratios of the estimated sum of variances to the es- 
timated sum of all covariances are 0.14/0.45 and 0.85/3.30, respectively, and for the least correlated pairs, 
[AAPL+XOM] and [PFE+XOM], respectively, we have 0.09/0.20 and 0.84/1 .59. 

We perform the test from Section [7] to investigate three hypotheses: if MSFT and ORCL have a zero co- 
variation conditional on PFE; ORCL and PFE conditional on MSFT and MSFT and PFE conditional on 
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multi 

[Xi,X 2 ] T 


AAPL 


MSFT 


ORCL 


XOM 


PFE 






AAPL 




2.82 ±0.01 


1.23 ±0.02 


1.66 ±0.03 


0.63 ±0.01 


0.99 ±0.03 






MSFT 








2.87 ±0.03 


2.05 ±0.05 


0.94 ±0.02 


1.25 ±0.07 






ORCL 










3.81 ±0.06 


1.31 ±0.03 


1.63 ±0.10 






XOM 












2.36 ±0.01 


1.25 ±0.04 






PFE 














3.83 ± 0.09 




multi 

[Xi, X2\ T 


AAPL 


MSFT 


ORCL 


XOM 


PFE 


AAPL 


8.16 ±0.10 




3.42 ±0.20 


3.74 ±0.20 


2.28 ±0.11 


2.47 ±0.26 


MSFT 






12.43 ±0.38 


7.43 ±0.37 


2.95 ±0.20 


4.69 ±0.47 


ORCL 










11.90 ±0.39 


3.73 ±0.22 


2.90 ±0.49 


XOM 












6.42 ±0.12 


1.75 ±0.29 


PFE 














19.59 ±0.70 



Table 3: Estimates for integrated covariances (• 10 3 ) 2010/08 (top) and (• 10 5 ) 2010/08/02 (bottom). 



ORCL. From the higher correlation of MSFT and ORCL, we may expect that the first will probably not 
apply. We obtain the following three test results (two-sided test) with p-values p and statistics Q which are 
asymptotically standard normal under H : 

Q = -8.10; -2.47; -1.85 p = 0.00; 0.014; 0.064 (2010/08), 
Q = -10.00; -0.05; -1.24 p = 0.00; 0.96; 0.21 (2010/08/02). 

Tests if MSFT and ORCL have zero covariation conditional on the sum of all assets yield p- values 0.13 for 
2010/08 and 0.23 for 2010/08/02. Tests for ORCL and PFE conditional on the sum of the three other assets 
yield p-values 0.19 and 0.99. In conclusion, this empirical evidence suggests that MSFT and ORCL have 
some dependence not explained by a common macro factor influencing all NASDAQ assets. Contrary, we 
can not reject this for several other combinations. Some differences between 2010/08 and 2010/08/02 give 
an heuristic that the portfolio dependence structure is not completely persistent. Though there are some 
limitations where the additive noise model does not perfectly fit the stylized facts of the considered high- 
frequency data as discreteness of returns and zero returns, the approaches developed in this research area 
and advancements in this article provide reliable tools to quantify risk measures and determine confidence 
intervals from high-frequently documented asset prices. 



22 



on co 

00 (N 
O ON 



m NO K 
oo on ■ 

CO rj OO 



NO 00 
00 oo 

co ON ON 



O ON T— O 

oq in r-; on no in 
' t \t ri ci ^' 













co 


oo 




o 
m 


00 
ON 


NO 




on 


ON 


on 


r~ 




in 


co 


on' 


o< 














IN 


IT; 


00 


■* 


CO 


on 


c 




ON 


On 


t- 


NO 


On 


ON 




NO 


pi 


■* 






00 





ONONco[^t--^inoNONNOr~-ONO 



00 O 

Mm* 



on co 

■* ON 
ON 



C 

!-! on 



00 

ON 



inooo^tcoco^HOOcNinr^ 
intNi^-'rt'dtninvi-^r^vd 



co ^t- 

ONOo'^odNdo6^Jy2Nd^ 



ONr-;CN||-^ON— <.ON 



OOCO^^tON^NO^ 

incoON^tr^oqoqcN 



^ ^ os oo m W S 

[ — t — 00 ON NO . . 

\d 1^ ri iri »i ^ 2 



C 00 in in <o 

on oo rn co 
no in no 



NO ON O in CO 
N h io oo q 
vi N t ri h 



H H N 



oono-^onnoou-) 



co co . ON 2 

*rt- t~ l CN ,-t- 



n a o 



« co 



OONTt-^J-lnNOTj-t^OOOOTt 

OSMI^^OfClin^rH^OOO 



o i o o CO 



<N CO in 



r-NOO^t-OOOO^m^CNCNCOCN 

cNinr-ocNNOoococoONt— cono 
odes— idddHOOHO- i 



ON^j-in^tONONiriON^HOOoo 
ooinq^qin^qiiinhoo 



COONr^CO^HCOCONOOCOCO 
OHMHOcitMOvifi 



^H<N^00^H<Ni/-)i/->or, 
o^^fsi^-joor^Nq^Nq : 



on r- in m r- 

no <N co on in 



NO OO ' — 1 
00 NO ON 



o-*(NOin„m<NON 



NO ON 00 ON 
Tf CO On r— 



CO 

ON 



o on in in o r» 
in co o oo oo on on 



on r- no ^ cm ic: 
•n no on oo in . 
— i in on — i on 2 



in 00 ON 00 On 

no r-; on no ON 
d h h d n 



f- ON ON no 
t- On i— 00 

d d h d 



O NO 



23 



Acknowledgements 

The authors would like to thank Dacheng Xiu, Ruey Tsay and Dan Christina Wang for inspiring discus- 
sions. We also thank Johannes Schmidt-Hieber and Till Sabel for a discussion on the relation of estimators 
via quadratic forms for noise smoothing who have worked on the transformations between several of these 
estimators. 

Markus Bibinger was supported by a fellowship within the Postdoc-Programme of the German Academic 
Exchange Service (DAAD) and the CRC 649 'Economic Risk' at Berlin, supported by the Deutsche 
Forschungsgemeinschaft (DFG), and also from the Stevanovich Center for Financial Mathematics at the 
University of Chicago. Per Mykland was supported under National Science Foundation grant SES 11- 
24526. We gratefully acknowledge this financial support. 

The NASDAQ trading data used in Section[8]has been reconstructed from the order book with LOBSTER 
provided by the high-frequency econometrics team at the Chair of Econometrics, School of Business and 
Economics, Humboldt-Universitat zu Berlin, Germany. 

References 

Ait-Sahalia, Y., J. Fan, & D. Xiu (2010). High-frequency estimates with noisy and asynchronous financial 
data. Journal of the American Statistical Association 705(492), 1504-1516. 

Ait-Sahalia, Y., L. Zhang, & P. A. Mykland (201 1). Ultra high frequency volatility estimation with depen- 
dent microstructure noise. Journal of Econometrics, 160, 160-165. 

Andersen, T. G. & T. Bollerslev (1998). Answering the skeptics: Yes, standard volatility models do provide 
accurate forecasts. International Economic Review 39, 885-905. 

Andersen, T. G, T. Bollerslev, F. X. Diebold, & P. Labys (2000). Great realizations. Risk 13, 105-108. 

Andersen, T. G, T. Bollerslev, F. X. Diebold, & P. Labys (2001). The distribution of realized exchange rate 
volatility. Journal of the American Statistical Association 96, 42-55. 

Andersen, T. G, T. Bollerslev, F. X. Diebold, & P. Labys (2003). Modeling and forecasting realized 
volatility. Econometrica 71, 579-625. 

Barndorff-Nielsen, O. E., P. R. Hansen, A. Lunde, & N. Shephard (2008). Designing realised kernels to 
measure the ex-post variation of equity prices in the presence of noise. Econometrica 76(6), 1481-1536. 

Barndorff-Nielsen, O. E., P. R. Hansen, A. Lunde, & N. Shephard (2011). Multivariate realised ker- 
nels: consistent positive semi-definite estimators of the covariation of equity prices with noise and non- 
synchronous trading. Journal of Econometrics 162(2), 149-169. 

Barndorff-Nielsen, O. E. & N. Shephard (2001). Non-Gaussian Ornstein-Uhlenbeck-based models and 
some of their uses in financial economics. Journal of the Royal Statistical Society, B 63, 167-241. 

Barndorff-Nielsen, O. E. & N. Shephard (2002). Econometric analysis of realized volatility and its use in 
estimating stochastic volatility models. Journal of the Royal Statistical Society 64(2), 253-280. 

Bibinger, M. (2011a). Asymptotics of asynchronicity. Technical Report, Humboldt-Universitat zu Berlin, 
URL= |http: //sfb64 9 . wiwi . hu — berlin.de/papers/pdf/SFB64 9DP2011 — 033 . 

[pdfj 

Bibinger, M. (2011b). Efficient covariance estimation for asynchronous noisy high-frequency data. Scan- 
dinavian Journal of Statistics 38, 23-45. 

Bibinger, M. (2012). An estimator for the quadratic covariation of asynchronously observed ito processes 
with noise: Asymptotic distribution theory. Stochastic Processes and their Applications 122, 241 1-2453. 



24 



Bibinger, M., M. ReiB, N. Hautsch, & P. Malec (2012). Asymptotically efficient covolatility matrix esti- 
mation via local likelihoods, preprint, HU Berlin. 

Black, F. S. & M. S. Scholes (1973). The pricing of options and corporate liabilities. Journal of Political 
Economy 81, 637-654. 

Christensen, K., S. Kinnebrock, & M. Podolskij (2010). Pre-averaging estimators of the ex-post covariance 
matrix in noisy diffusion models with non-synchronous data. Journal of Econometrics, 759(1), 1 16-133. 

Christensen, K., M. Podolskij, & M. Vetter (2011). On covariation estimation for multivariate continuous 
ito semimartingales with noise in non-synchronous observation schemes. CREATES research paper 
2011-53. 

Eberlein, E., R. Frey, & E. A. von Hammerstein (2008). Advanced credit portfolio modeling and cdo 
pricing. In Mathematics - Key Technology for the Future, edited by W. Jdger and H.-J. Krebs, Springer, 
253-280. 

Epps, T. W. (1979). Comovements in stock prices in the very short run. Journal of the American Statistical 
Association 74, 291-298. 

Gloter, A. & J. Jacod (2001). Diffusions with measurement errors 1 and 2. ESAIM Probability and 
Statistics 5, 225-242. 

Harrison, M. & D. Kreps (1979). Martingales and arbitrage in multiperiod securities markets. Journal of 
Economic Theory 20, 381^-08. 

Harrison, M. & S. Pliska (1981). Martingales and stochastic integrals in the theory of continuous trading. 
Stochastic Processes and Their Applications 11, 215-260. 

Hayashi, T. & N. Yoshida (2005). On covariance estimation of non-synchronously observed diffusion 
processes. Bernoulli 11(2), 359-379. 

Hayashi, T. & N. Yoshida (2011). Nonsynchronous covariation process and limit theorems. Stochastic 
Processes and their Applications 121, 2416-2454. 

Heston, S. L. (1993). A closed-form solution for options with stochastic volatility with applications to 
bond and currency options. Review of Financial Studies 6, 327-343. 

Isserlis, L. (1918). On a formula for the product-moment coefficient of any order of a normal frequency 
distribution in any number of variables. Biometrika 12, 134-139. 

Jacod, J. (1997). On continuous conditional gaussian martingales and stable convergence in law. Seminaire 
de Probabilities, Strasbourg, tome 31, 232-246. 

Jacod, J. (2012). Statistics and high frequency data. Proceedings of the 7th Seminaire Europeen de Statis- 
tique, La Manga, 2007: Statistical methods for stochastic differential equations, edited by M. Kessler, A. 
Lindner and M. S0rensen. 

Jacod, J., Y. Li, P. A. Mykland, M. Podolskij, & M. Vetter (2009). Microstructure noise in the continous 
case: the pre-averaging approach. Stochastic Processes and their Applications 119, 2803-2831. 

Jacod, J. & P. Protter (1998). Asymptotic error distributions for the euler method for stochastic differential 
equations. The Annals of Probability 26, 267-307. 

Mykland, P. & L. Zhang (2012). The econometrics of high frequency data. Proceedings of the 7th Seminaire 
Europeen de Statistique, La Manga, 2007: Statistical methods for stochastic differential equations, 
edited by M. Kessler, A. Lindner and M. S0rensen. 

Mykland, P. A. (2012). A gaussian calculus for inference from high frequency data. Annals of Finance 8, 
235-258. 



25 



Mykland, P. A., N. Shephard, & K. Sheppard (2012). Efficient and feasible inference for the components 
of financial variation using blocked multipower variation. Technical Report, University of Oxford. 

Mykland, P. A. & L. Zhang (2009). Inference for continuous semimartingales observed at high frequency. 
Econometrica 77, 1403-1455. 

Park, S. & O. Linton (2012). Estimating the quadratic covariation matrix for an asynchronously observed 
continuous time signal masked by additive noise. FMG Discussion Papers 703. 

Renyi, A. (1963). On stable sequences of events. Sankhya: The Indian Journal of Statistics, Series A 25(3), 
293-302. 

Rootzen, H. (1980). Limit distributions for the error in approximations of stochastic integrals. Annals of 
Probability 8, 241-251. 

Ross, S. M. (1976). The arbitrage theory of capital asset pricing. Journal of Economic Theory 13, 341-360. 

Xiu, D. (2010). Quasi-maximum likelihood estimation of volatility with high frequency data. Journal of 
Econometrics 159, 235-250. 

Zhang, L. (2001). From Martingales to ANOVA: Implied and Realized Volatility. Ph. D. thesis, The 
University of Chicago, Department of Statistics. 

Zhang, L. (2006). Efficient estimation of stochastic volatility using noisy observations: A multi-scale 
approach. Bernoulli 72(6), 1019-1043. 

Zhang, L. (2011). Estimating covariation: Epps effect and microstructure noise. Journal of Economet- 
rics 160, 33-47. 

Zhang, L., P. A. Mykland, & Y. Ai't-Sahalia (2005). A tale of two time scales: Determining integrated 
volatility with noisy high-frequency data. Journal of the American Statistical Association 700(472), 
1394-1411. 



26 



APPENDIX: PROOFS 



A Proofs 
A.l Preliminaries 

The local boundedness condition in Assumption [T] can be strengthened to uniform boundedness on [0, T 



by a localization procedure carried out in Jacod ( 2012| l, Lemma 6. 6 of Section 6 .3. Let C be a generic 



constant and denote AiW — Wjj-i n — W \i~i)T / 'm i — 1> • • • > n > f° r the Brownian motion W driving the 
SDE with solution X in ([TJ and A^ct = <r i2 y n — o , u—i)T/n- Consider some norm || • ||, e. g. the euclidean 
norm, on R p . Suppose As sumption [T] holds. By several applications of the Burkholder-Davis-Gundy and 
Holder inequality one can obtain the following estimates: 

E [\\AiXf + ||AiW|| 2 |.F (i _ 1)/n ] < Cn-\ E [||A l( r|| 2 | F {i - 1)/n ] < CrT 1 , (A.la) 

E [|| A,* - A 4 VK|| 2 |j- (j _ 1)/n ] < Cn- 2 , (A.lb) 

for equidistant observation schemes. For general synchronous sampling ( |A.la i and ( |A.lb[ ) remain valid 



when replacing n by 5 n 1 with S n = sup i (ij — The estimates |ATa]i and 



A.lb| ) are proven in 



Jacod 



(2012 1, among others. They are used repeatedly in the analysis below. We will as well make use of the 
direct extensions for increments with respect to subsampled time lags and the componentwise versions of 
(jAJaJ and flATb) , 



Recall Q, which is a special case of the result by Isserlis ( 1918| l and used implicitly throughout the proofs 



below. For notational convenience we write again a n ~ p b n if a n = O p (b n ) and b n = O v {a r , 

A.2 Proof of Proposition |2.1| 

Applying |6) and ( |A.lb| i, by the Ito isometry we deduce for the random conditional covariance 

(n n \ n 

^A^^A^^A^A^ 4 ) ~p ^Covi-i (A^A^lA^XX^ 
i=l i=l / i=l 

CT (13) (24) (14) (23) 



i=l 
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where we write Cov;_i( • ) for Cov( ■ |.F(i-i)T/n) ar, d Covs( ■ ) for the random covariance dependent on 
S. By convergence of the Riemann sum to the integral, this yields 

f^AiX^AiX^^AiX^AiX^) -AT jT ( (J (13) (J (24) +(7 (14) ff (23 ) ^ dfi 



□ 



A.3 Proof of Theorem O 

For the proof that 

(rtiulti) (kernel) 



- \ iuuiiLt — — — — I Kernel I \ / v 
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if = h in ( fTOj i, it suffices to focus on the first-order term of the weights. Transforming (|9) yields 



M„ 



M„ 



n iA(j-l) 



E - E A*y«Ajy( 2 ) = E a, E E 1 - ' (Aj-yt^Aj-.jyW + A.y^A^yW) 



»=1 J=» 



i=l \ j=2 1 = 1 

n 

+ EA J y (1) A J y( 2 )-i?„ 

M n n M„ , . s 

E E E^( 1 -D (A ^ F(1)A ^ y(2)+A ^ (2)A ^ F(1)) 



+ EA J y (1) A J y( 2 >-i?„ 

where we suppress additional component superscripts on M„. The term i? n induced by end-effects 

M„ 



»=i \j=i 
+ E 

j— n — i+2 



(i-i)Ai . 



£ ^( Ajr (i) Aj _ ; y(2) + A^A^^ 1 ) 



z=i 



i — n + j — 1 



i/\{n~j) 



A J y( 1 )A J y( 2 )+ E (A^y^A^y^ + A^A^yW 



has an expectation by noise: 



V12 E a - ( E 



i— 1 . . i— 1 . . 7i— 1 . . n— 2 

i — j x -» i — j ^-^ i — n + j ■t—v i — n + j 



i *■ — ' % 



+ E 



j=n— 



E 

j=rt— i+1 



= 4?7 



12 ■ 



The variance of this term is negligible which can be shown with standard bounds. For the main term above, 
we can detach the inner sum and find that 



5> i- =£ 



^"(x) [a; 



Mr, 



dx 



o(n-V«) 

+o (»-")=Ki) + 



e> n 



by partial integration under the restrictions made on K. This yields the form ( fTTj ) of the transformed kernel 
estimator and our claim. That the integral approximation does not harm the above equality up to the 
o (n~ ' 4 )-term, can be seen by the estimate 



(i+l)/M„ 



/(*)-/ 



-f( 



I 



dx < 



(i+l)/M„ 



c 



i/M n 



dx < CM: 



with generic constant C,i > I, for the Lipschitz function 



f{x) = &"{x) X 



on the compact support [0,1], where Lipschitz continuity is ensured by the preconditioned continuous 
differentiability. 
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A.4 Proof of Proposition |3.1| 

Decompose the multi-scale estimator Q for [X^ k \X^] T , (k, I) £ {1, 2, 3, 4} 2 



Ml 



M: 



Ml 



(fe) 



i=l j=i 



M< fei > 



i=l j=i 



M; 



^^A}X«A} e W+ £ -E A i e! ' )A i lW ' 



i=l j=i 



with A^e^ analogously defined as for X"\ in a signal part, a noise part and cross terms which are uncor- 
rected for each component. We analyze the covariances which stem from covariances of the signal, noise 
parts and cross terms consecutively. The elements of the proof of a stable central limit theorem is similar 
as in Zhang (2006) and Bibinger (2012), respectively, and hence we restrict ourselves to the evaluation of 



the general covariance including possibly different frequencies and weights for equidistant sampling. 
We can write the signal part of the addends of [X W , X^]t in the way 



il2AZ 



lb it, ib " 1Z' / 

— E a; i2 * (1) a; i2 * (2) = X>3* (1)a 3* (2) +E a < x(1) E f 1 



3=»12 



3=1 



1=1 



il2Ai 



3=1 



'12 



3=1 



'12 



The first addend is the usual realized covariance and will contribute only asymptotically negligible covari- 
ance terms in the noisy setting with slower convergence rate. We derive that 



, Vs [JLy A!. 12 X«ArX< 2 \ — y Af 4 X {3} Af 4 X {i) I ~" 
\ «i2 Z — Jj J ^ J 

\ J=H2 3 = »34 / 



Tin i pn i pn , pji 
1 13 "T 1 14 T 1 23 "r 1 24 > 



with conditional covariance terms 



i=i 



n min (ii2, 134,0 , • \ / 

rr 3 = ^ E [A i x( 1 )A ; x( 3 )] £ l 1 "^)! 1 

3=1 1 1 



^7 
in 



?34 



n min (ii2,i34,i) / • \ / ■ \ 1 



'12 



and analogously for the other addends. The smoothness of S ensured by Assumption [T] giving the bound 
in ( |A.la[ ) suffices that we can approximately take on_x)Tln m a ^ addends of the second sum. Hence, for 
the equidistant case the analysis boils down to simply evaluating the deterministic inner sum: 



£ 1 

3=1 V 



'12 



1 - 



'34 



1 1 



rn 



m m 
2~ ~ 6M ~ 8 + 12M ~ "6 



3- 



M) ' 



where m = min (112,134) and M = max («i2, £34). For M n = min [m^\ Mi 34) ^j denote the limit of 
the following series including weights according to ( fT0] i 



M„ k 



5>« = lim M^J2J2 



fc=l Z = l 



6M„ 



This leads to the result 



n 

M 



M„ „ T 

«12, 134 = 1 
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Similar results hold for T" 4 , T!^, T^. Thereby we conclude the signal term of ( fl7] >. T) a is a constant 
showing up in the asymptotic discretization variance depending on the weights, where for the standard 
weights (jT6]» or cubic kernel D a = 13/70. Eventually, we also obtain the general form of the asymptotic 
discretization variance of the one-dimensional estimator. 

The addend induced by market microstructure noise in the multi-scale estimator |9]l is 



M^> n Ml 



i=i j=i i=i \ j=i 



(1)J2) 



i-l 



_ y f e (i) e (2). + e (2) e (D. _ y e <i) e (2) _ y e (D e ( 

\ 3 3~ % 3 Z-^i 3 3 ^ I J 

j=» j=n— i+1 j=0 

The last two sums lead for the non-adjusted multi-scale estimator |9]l to the negative bias by noise and end- 
effects. We have focused on the bias and an adjusted corrected estimator in Section[3]and will concentrate 
on the adjusted non-biased version in the sequel. The first inner sum on the right-hand side does not 

M (12> 

depend on i and vanishes since zZi=x a i/i = 0- The covariances of the remaining uncorrected addends 
contribute to the total covariance due to noise perturbation. Denote the constant limits 

M„-l / M„ \ 2 M„ 2 

m% = lim M„ V V — and 9t« = lim MlY % 



I j n— >CO ' %^ 

j = l \i=j + l I j=l 



and rewriting 



Af< 12) » n M< 12 >Ai 

__ srJ 1 )^) - __ e (i) e (2). 

n ' J J J " ' J ' j 2 ^ 3 ' 



EjE«=E E 

i— 1 j— z j— i i— 1 




) e (2) +e (1) . e (2) . 

3 n-3 c n-j 



JV^ 12 > 



E 



, 2 

j=0 j=n-s.+ l / j=0 »=i+l 



we obtain with M„ = min MA ; , M, j that 



in^/^M^) 



M< 12 > „ M< 34 > 

**» E ?E«f#i. E lE'f'S'i] ^vt™*., 

=1 j=i i=l j=i 



and that 



'a/< 12 ' / n 4 _i M< 34 > 



i-l 



WE? E W+EWEt E 4 3 M 4) +E4 3 M 

V i=l V j— n— i+l j— i-l j = 7i— i+1 J = 
► 0^ (??13?724 + %4%s) ■ (A.4) 

The analogous implications for all addends contributing to the total covariance by noise lead to the noise 
parts in ( fTTl). For (fT6l), w e have Ot™ = 6/5 and 91" = 12, which gives the minimum of the variance due to 
noise (see Zhang (2006 1). Note that for the weights from ( fT0| i, the respective frequencies M„ , 1 < k < 
4, 1 < I < 4, are inserted, but we leave out further indices for a better readability. 
Finally, consider the cross terms of Q. They can be decomposed in addends of the form 

EtE«M° (a.5) 

i=l j=0 
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for components (k, I), where 



-A! 



,0< j < (i-1) 



= <| Ap( fc ) - A} +i XW ,i<j<(n-i) 

A}X^ 



, n — i + 1 < j < n 



The asymptotic covariance coming from cross terms, by the martingale structure of the above term can be 
deduced by the limit of the sum over j covariances conditionally on Ft \J cr (e;, I < j — 1). We thus have 



/ M (12) 



A/< 34 ' 



M< 12 > 



M< 34 > 



(4) 



j=l j=0 



i=l j=0 



j=0 



i=l 



Now, similarly as in Zhang (2006) and Bibinger (2012), if we assume without loss of generality 1 < iia < 

«34 < M n = min (m£ 2) , Mi 34) J , it holds that 



Jk) M) 



^ A r X«- £ A r X« 



J— 1 134+j-l 

\r=j-i34 r=j 



(j'-iia-l !34+i-l \ 

£ A r X«- ^ A r X« fc = l,2;Z = 3,4. 
r=j— »34 r=j+i 12 ) 

The conditional covariances from these addends yield for u, k,l,v £ {1, 2, 3, 4} 



'M< 12 > „ M< 34 > 

M.covJE^Ec'iM-'.EfEcS^ 



i=l j=0 



i=l j=0 



»=1 r=l y V ' J = (iAr) / 

-Aot^Jx^xW" . 

L J T 

For the specific weights ( [To*] ), the constant takes the value 9Jt Q =6/5. This leads to the total covariance by 
cross terms completing the proof and we conclude Proposition (37TJ □ 



A.5 Proof of Proposition 4.1 



Recall the illustration ( |20| ) of the Hayashi-Yoshida estimator with refresh times and interpolations. We first 
consider the term D T 2 D T 4 from the product of the two synchronous-type approximations. Only overlap- 
ping increments contribute to the asymptotic covariance: 



E [Drj? Dp] = £ 1 {max {Tl\,Tftx 



)<min(T 12 ,T 34 )} 



x E 



^ T- 12 T 12 J y T 12 T 12 y \^ T 34 ^7^34 y 1 \^ T 34 ^7^34 



and thus 



Cov 



N 

(D^D?) = J2 Cov ((xg> - xg>J (xg> - xg>J , (xg> - xg> J (xg> - xg> , 



Only on intervals where increments of all four processes coincide, the expectation of the product with four 
factors does not equal the product of expectations. For that reason the refresh times Si, i = 0, . . . , N, of all 
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processes give the crucial synchronous approximation here. The other remaining addend is the covariance 
of the terms due to non-synchronicity for each pair A^A^fi. We have that 



maxC^T^),*-^),*- (T 34J it ~ (T 34 i))<min(t + (Ti L2 )t + (Ti L2 )it + (T 34 ), t +(T/*))} 



E[A?AP]=J2\ 

X Cov ((X^i2 + ~ -^T^ 2 4 , J 2 1 ~ ^T>\ ) {^T> 2 + ~ ^T> 2 ^T.^j _ ^T 1 ^ 
/ y (3),+ _ v (3) y(3) _ v(3) ,-\ / v-(4),+ _ y(4) y (4) _ y 



As explained in Section[3] we can arrange the coinciding interpolation terms using the structuring generated 
by the S^s. Consider now the sequence of covariances conditionally on Fsi-it i = 1, ■ ■ ■ ,N, and its sum. 
For the first term with the synchronous approximation this is 



E ((*£> - 41) (4? - 41) - (4? - 41) (4? - 41) 1^ 

AT 

i=l 
JY 



i=l 

AT 



^< 13 ' rr( 24 ' J- rr( 23 ) ^ (G(^) - G(g t -l)) 

2^ \ a St-i a Si-i + (T S l - 1 (T S l - 1 ) g. _ s _ \ b i - b i~V 

i—1 1 1 

and analogously for Cov(A^ , Aj^) with all interpolations. The difference quotients will converge on 
As sumption [3] and the Riemann sums to the corresponding integrals. With all functions from Definition [T] 
and Assumption[3j we obtain 



A/rWw T ( ( (13) (24) . (14) (23) \ (G(Si) - G(Si-i)) . . 

ACOV =p- hrn^ ^ + ^sJAJ,) g. _ g._ (St - S^) 



N 



V (13) (24) (F^-F^-PH^SO-if^^ q v 



i = l 
N 



Si Si— 1 

V- d4) (23) - 

i=l 1 / 

and this limit in probability of the above sum equals ( |24"| >. □ 

A.6 Proof of Proposition |5.1| 

Consider four possibly different observation schemes, but also allowing for subsets of synchronous sam- 
pling times. We use the notation from Section[3]for interpolations and refresh times. For the error due to 
noise dA. 3b becomes 



Ml 
N 



>i2 m£ 2) aU+i) 



j ^ ov 2^ 2^ ^ K+ (T i 2) %- ( ^2 i) +e t + (T i 2) e ir(T/fi) I 
\i=o i=i x 7 



(??13?7246 24 G: 24 + 77l4??236? 3 e 



35) 



on the assumption that the limits in Definitionjijexist with £ 24 , = rimAr_ i . 00 Mf^ Y^kLi ^ff^ over a ^ (*> 
for which the corresponding indicator functions in the Definition[3]equal one. It holds that max (£ 24 ., £ 23 ) 
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< Wti by the Cauchy-Schwarz inequality and that each observation point can only equal one of the other 
scheme. The generalization of ( |A.4[ ) incorporating general observation schemes is 

Mn — 1 Mn 



WCov V V — e (1) e (2) +e (1) e (2) 



j=0 »=j+l 



t"(2 



A/jv — 1 Mjv 

E E " 

3=0 i=J+l 



"12 -f' * 2 ^»12-J' 1 



.(2) 



04 II) 



, e (2) 



Again we have max ^£ 2 |, £ 2 |^ < OT^i i- e- constants smaller or equal the ones in the synchronous setting. 
For the analysis of cross terms we can build on the findings from above and the rewriting ( |A.5| l. Directly 
ignoring interpolations we set 



,a<j< (i-i) 



A)XW - A) +l X^ ,i<j<(n-i) 
A}X( fe ) ,n-i+l<j <n 



with A)XW = {X { *1 - X!& ) for k = 1, 2, and A)X ( ^ = ( X^i - X}& ) for k = 3, 4, here. The 



-CO 



T 34 



conditional covariance of the cross terms ignoring asymptotically negligible interpolation steps yields 



EE E 

j=0 fc=0 



e (D £ (3) 

t« t< 3 > 



A/ 



A/ ' 



1 {t <1) =t< 3) ^ 



> E t« } E ?^ 



«i I. 



(4) 
k 



i=l 



and 



?(2) ?(4) 

'«12,i^i34>fe 



J — 1 »12+3 — 1 

53 A r x( 2 )- ^ A r X< 2 

,r=j— ii2 r=j 




53 A r X 



(4) 



(j'Afe) — 1 min (ii2+J — l,i34 + fe— 1) 

53 A r X< 2 )- ^ A r x( 2 ' 

ir=max(j'-ii2,fc-i34) r=(j'Vfe) 



(jAfe) — 1 min («i2+j— l,iM+fc— 1) 

.r=max (3— ii2,fc— 134) r=(jVfc) 



Compared to the synchronous case this product includes refresh time instants and in general j 7^ k, but if 



(1) _ + (3) 



t k holds, locally Q 12 jQ k estimates J A a 14 ds, only that here the interval A is determined by 
(j, k) and 134)- Taking the sum for the multi-scale estimates, we end up with the same approximation 
as above, but including the limiting function of S[ 3 to describe the proportion of synchronous times 



<M 



(12) 



A/ 



TI4 

^ I E tE2X'. E tESV" 




E 



A (*Ar) y(2) A (iAr) y ( 4 ), 



3\k— (iAr) 



The other addends are treated analogously. 

It remains to show 06} for the discretization term, in most situations the only non-vanishing term in the 



asymptotic covariance. Proposition 5.1 and Corollary 5.2 state that the asymptotic covariance equals the 
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one of the experiment in which we observe all processes synchronously at times 5j,0 < i < N. As a 
first step of the proof it can be shown that the discretization error induced by interpolations to refresh times 
Tl 2 ,0 < i < N%2, and Xf 4 ,0 < i < -ZV34, for each pair is asymptotically negligible. This is proved 



analogously as Proposition A. 10 in Bibinger (2012 1 and we omit it here. The second step would be to 



show that the covariances between an approximation error where refresh times of one pair are interpolated 
to the refresh times Si, i = 0, . . . , N, of all four processes and the other multi-scale estimate tends to zero 
in probability. As discussed in Section[3]these terms have non-zero expectation, but do not contribute to the 
covariance which already gives the result. Alternatively, the conclusion can be thoroughly comprehended 
by adapting ( |A.2[ ) to the two refresh time schemes after the first approximation step. All addends with 
overlapping refresh time instants contribute to the sum and by the Ito isometry and approximation |ATb]i, 
the single addends are of the form 

(min (If ,Tf ) - max (If ^if^)) . 

Indeed, the interpolation terms fall out of the sum and in consequence the proof boils down to the syn- 
chronous setup after passing to the refresh times proxy. Yet, we still need to develop the severally interest- 
ing asymptotic distribution theory for irregular non-equidistant synchronous sampling. Compared to Zhang 
(2006) and Bibinger (2012 ), we forego the assertion that the observation scheme is close to equidistant such 
that the approximation in ( |A.2| i is still valid with an asymptotically negligible approximation error. 
Generally, with the limiting function 



M N rAiAk 



L N — 

S r <t i,k=l <?=0 



r-q , 



which exists under Assumption[3](i) and if the difference quotients also converge uniformly under Assump- 
tion [3] (ii), we obtain in place of |A.2[ ): 

Ar Mn at NASi Mn lAii 2 Ai a4 . / \ / \ 

l,k = l ( = 1212,234 = 1 9 = 

i-T 



f a^af^{D a )' [a) da. 
Jo 



The existence of 



N 



rAiAk 



(iAk)T 

v ; s r <t 9=0 

is ensured on Assumptionblby dominated convergence and the convergence of M^ 1 ^2 ai«fc(i A k) on the 
conditions for the weights (10]). For the weights (jT6j) and the cubic kernel M^ 1 ^ 0^0^(1 A k) ~ 38/70. 
The above finding is valid in the same fashion in the one-dimensional setting where 



M n rAiAk 

^ = il^fE ^ £ cua h £ (l - f ) (l - f ) At. 

t r <t i,k=l 9=0 



1 



The explicit formula for the weighted local sampling autocorrelation is useful to analyze specific examples: 

wLSA[ fe = n {{At r ) 2 + (l - i Vl - ^ At r At r _! + ^) Ai r Ai r _ 2 + . . . 

if i A k\ \ 

1 - — r At r At MAk . (A.6) 



i A k \ i V fc r 

Since the first addend is negligible as n — > 00, ( |A.6| ) has a simple nature for i. i. d. random sampling 
independent of Y as for the equidistant setup. Analogous reasoning for the other addends contributing to 
the discretization variance leads to ([36]) which completes the proof of Proposition|5.1| □ 
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A.7 Proof of Proposition |6.1| 

Consider the estimator ACOV from ([37} for the asymptotic covariance |4]) of integrated covariances. On 
As sumption [T] we have 



E 



ACOV 



n-l 



i=l 
n-l 

E E 

-y r 

T ^ / (i-l)T s 



(i+l)T (i + l)T 



8 = 1 

i-l „*E 



(i+l)T (i + l)T 

af) dW< 2 > I " CT (3) dW (3) + / " a W dW W f " ^(4) dW (4) 



i=l 
n-l 



I! 

n, 2 



(24) (23) (14)\ 

i=1 \ n n » / 



n-l 



V- f (23) (14) (13) (24) 

/ y 1 u (i-l)T " (i-l)T " (i-l)T " (i-l) 











/ n 





with ( |A. 1 a| >, ( |A.lb| > and Ito's isometry. The variance is of order what can be seen e.g. by bounding 

the second moment by Burkholder-Davis-Gundy inequality, which implies consistency. 

A histogram-wise approach as ( |3~8] l proposed for the general setup is typical for these kind of methods and 

consistency can be proved similar as for the asymptotic variance estimator in Proposition 5. 1 of Bib inger| 

( l20T2] l. 
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